Skip to main content
GET
/
v1
/
documents
/
{document_id}
/
content
Get extracted content
const options = {method: 'GET', headers: {Authorization: 'Bearer <token>'}};

fetch('https://api.factify.com/v1/documents/{document_id}/content', options)
  .then(res => res.json())
  .then(res => console.log(res))
  .catch(err => console.error(err));
{
  "version_id": "ver_01h2xcejqtf2nbrexx3vqjhp41",
  "source_type": "pdf",
  "document": {},
  "spreadsheet": {},
  "layout": {}
}

Documentation Index

Fetch the complete documentation index at: https://developers.factify.com/llms.txt

Use this file to discover all available pages before exploring further.

Authorizations

Authorization
string
header
required

Bearer authentication using a factapi-issued API key (ffy_<env>_<base32_uuid><base62_random>). Cookie-based sessions are accepted automatically by user-facing endpoints but are not surfaced as an OpenAPI auth scheme.

Path Parameters

document_id
string
required

Document ID

Example:

"doc_01h2xcejqtf2nbrexx3vqjhp41"

Response

OK

Extracted text and structured content for a document version. The document, spreadsheet, and layout fields are mutually exclusive — exactly one is populated based on source_type. Only available once the version reaches ready status.

version_id
string
required

ID of the version this content was extracted from.

Example:

"ver_01h2xcejqtf2nbrexx3vqjhp41"

source_type
string
required

Origin format of the version, controls which content field is populated. pdf/docx/markdown populate document; xlsx/csv populate spreadsheet.

Example:

"pdf"

document
object

Document-format content (PDF/DOCX/Markdown). Free-form structured object — schema depends on the extractor version.

spreadsheet
object

Spreadsheet-format content (XLSX/CSV). Free-form structured object — schema depends on the extractor version.

layout
object

Optional per-page layout metadata (bounding boxes, reading order). Present when the extractor produced layout output.