Skip to main content

Extract — Overview

The Extract workflow turns uploaded documents into editable, reviewable, and reusable project knowledge.

What Extract Does

Extraction in Engenerate is not a black-box process. When a document is processed, the results are immediately available for review, editing, and correction before they become part of your working knowledge base. This review-first design means you stay in control of what goes into your project.

Supported File Types

TypeFormats
DocumentsPDF (including scanned), Word (.docx), PowerPoint (.pptx)
SpreadsheetsExcel (.xlsx), CSV
TextPlain text (.txt), Markdown (.md)
ImagesCommon image formats (PNG, JPG, etc.)

Scanned PDFs are fully supported. Engenerate processes image-based documents through OCR, so scanned reports, drawings, and legacy records are handled alongside native digital files without any pre-processing step.

Uploading Files

Files can be added from the Documents tab inside a project's Resources panel.

You can:

  • Click the upload button and select one or more files
  • Drag and drop files directly into the Documents area
  • Upload multiple files in a single batch

Before uploading, you can optionally add:

  • A description for the document
  • Tags to help organize large projects
  • A workspace association to connect the document to a specific working environment

What Happens After Upload

Once a file is uploaded, Engenerate queues it for processing. Processing happens automatically in the background. You can see the current status of each document in the Documents list.

StatusMeaning
QueuedWaiting to be processed
ProcessingExtraction is in progress
ReadyProcessing is complete; content is available for review
ErrorProcessing encountered a problem; reprocessing is available

When processing completes, the document becomes immediately available for review, editing, and use in chat.

Reprocessing

If extraction results are unsatisfactory, you can reprocess a document. Reprocessing is useful when:

  • The first-pass extraction missed content or produced errors
  • A newer processing workflow is available
  • A figure or layout needs a fresh pass after edits

Reprocessing replaces the current extraction results while preserving the original source file.