Extract
Create Extract Job
List Extract Jobs
Get Extract Job
Delete Extract Job
Validate Extraction Schema
Generate Extraction Schema
ModelsExpand Collapse
class ExtractConfiguration:
Extract configuration combining parse and extract settings.
Optional<String> parseConfigId
Saved parse configuration ID to control how the document is parsed before extraction
Optional<String> parseTier
Parse tier to use before extraction. Defaults to the extract tier if not specified.
Optional<String> targetPages
Comma-separated page numbers or ranges to process (1-based). Omit to process all pages.
class ExtractJobMetadata:
Extraction metadata.
Metadata for extracted fields including document, page, and row level info.
Optional<DocumentMetadata> documentMetadata
Per-field metadata keyed by field name from your schema. Scalar fields (e.g. vendor) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. items) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively.
class ExtractV2Job:
An extraction job.
String status
Current job status.
PENDING— queued, not yet startedRUNNING— actively processingCOMPLETED— finished successfullyFAILED— terminated with an errorCANCELLED— cancelled by user
Extract configuration combining parse and extract settings.
Optional<String> parseConfigId
Saved parse configuration ID to control how the document is parsed before extraction
Optional<String> parseTier
Parse tier to use before extraction. Defaults to the extract tier if not specified.
Optional<String> targetPages
Comma-separated page numbers or ranges to process (1-based). Omit to process all pages.
Extraction metadata.
Metadata for extracted fields including document, page, and row level info.
Optional<DocumentMetadata> documentMetadata
Per-field metadata keyed by field name from your schema. Scalar fields (e.g. vendor) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. items) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively.
Optional<ExtractResult> extractResult
Extracted data conforming to the data_schema. Returns a single object for per_doc, or an array for per_page / per_table_row.
class ExtractV2JobCreate:
Request to create an extraction job. Provide configuration_id or inline configuration.
Extract configuration combining parse and extract settings.
Optional<String> parseConfigId
Saved parse configuration ID to control how the document is parsed before extraction
Optional<String> parseTier
Parse tier to use before extraction. Defaults to the extract tier if not specified.
Optional<String> targetPages
Comma-separated page numbers or ranges to process (1-based). Omit to process all pages.
Optional<List<WebhookConfiguration>> webhookConfigurations
Outbound webhook endpoints to notify on job status changes
Optional<List<WebhookEvent>> webhookEvents
Events to subscribe to (e.g. ‘parse.success’, ‘extract.error’). If null, all events are delivered.
Optional<WebhookHeaders> webhookHeaders
Custom HTTP headers sent with each webhook request (e.g. auth tokens)
class ExtractV2JobQueryResponse:
Paginated list of extraction jobs.
List<ExtractV2Job> items
The list of items.
String status
Current job status.
PENDING— queued, not yet startedRUNNING— actively processingCOMPLETED— finished successfullyFAILED— terminated with an errorCANCELLED— cancelled by user
Extract configuration combining parse and extract settings.
Optional<String> parseConfigId
Saved parse configuration ID to control how the document is parsed before extraction
Optional<String> parseTier
Parse tier to use before extraction. Defaults to the extract tier if not specified.
Optional<String> targetPages
Comma-separated page numbers or ranges to process (1-based). Omit to process all pages.
Extraction metadata.
Metadata for extracted fields including document, page, and row level info.
Optional<DocumentMetadata> documentMetadata
Per-field metadata keyed by field name from your schema. Scalar fields (e.g. vendor) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. items) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively.
Optional<ExtractResult> extractResult
Extracted data conforming to the data_schema. Returns a single object for per_doc, or an array for per_page / per_table_row.
class ExtractedFieldMetadata:
Metadata for extracted fields including document, page, and row level info.
Optional<DocumentMetadata> documentMetadata
Per-field metadata keyed by field name from your schema. Scalar fields (e.g. vendor) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. items) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively.