Extract — Overview
The Extract workflow turns uploaded documents into editable, reviewable, and reusable project knowledge.
What Extract Does
Extraction in Engenerate is not a black-box process. When a document is processed, the results are immediately available for review, editing, and correction before they become part of your working knowledge base. This review-first design means you stay in control of what goes into your project.
Supported File Types
| Type | Formats |
|---|---|
| Documents | PDF (including scanned), Word (.docx), PowerPoint (.pptx) |
| Spreadsheets | Excel (.xlsx), CSV |
| Text | Plain text (.txt), Markdown (.md) |
| Images | Common image formats (PNG, JPG, etc.) |
Scanned PDFs are fully supported. Engenerate processes image-based documents through OCR, so scanned reports, drawings, and legacy records are handled alongside native digital files without any pre-processing step.
Uploading Files
Files can be added from the Documents tab inside a project's Resources panel.
You can:
- Click the upload button and select one or more files
- Drag and drop files directly into the Documents area
- Upload multiple files in a single batch
Before uploading, you can optionally add:
- A description for the document
- Tags to help organize large projects
- A workspace association to connect the document to a specific working environment
What Happens After Upload
Once a file is uploaded, Engenerate queues it for processing. Processing happens automatically in the background. You can see the current status of each document in the Documents list.
| Status | Meaning |
|---|---|
| Queued | Waiting to be processed |
| Processing | Extraction is in progress |
| Ready | Processing is complete; content is available for review |
| Error | Processing encountered a problem; reprocessing is available |
When processing completes, the document becomes immediately available for review, editing, and use in chat.
Reprocessing
If extraction results are unsatisfactory, you can reprocess a document. Reprocessing is useful when:
- The first-pass extraction missed content or produced errors
- A newer processing workflow is available
- A figure or layout needs a fresh pass after edits
Reprocessing replaces the current extraction results while preserving the original source file.