Extraction Agents
Create Extraction Agent
List Extraction Agents
Get Extraction Agent
Delete Extraction Agent
Update Extraction Agent
ModelsExpand Collapse
ExtractAgent = object { id, config, data_schema, 5 more }
Schema and configuration for an extraction agent.
id: string
The id of the extraction agent.
The configuration parameters for the extraction agent.
chunk_mode: optional "PAGE" or "SECTION"
The mode to use for chunking the document.
Deprecatedcitation_bbox: optional boolean
Whether to fetch citation bounding boxes for the extraction. Only available in PREMIUM mode. Deprecated: this is now synonymous with cite_sources.
cite_sources: optional boolean
Whether to cite sources for the extraction.
confidence_scores: optional boolean
Whether to fetch confidence scores for the extraction.
extract_model: optional "openai-gpt-4-1" or "openai-gpt-4-1-mini" or "openai-gpt-4-1-nano" or 8 more or string
The extract model to use for data extraction. If not provided, uses the default for the extraction mode.
ExtractModels = "openai-gpt-4-1" or "openai-gpt-4-1-mini" or "openai-gpt-4-1-nano" or 8 more
Extract model options.
extraction_mode: optional "FAST" or "BALANCED" or "PREMIUM" or "MULTIMODAL"
The extraction mode specified (FAST, BALANCED, MULTIMODAL, PREMIUM).
extraction_target: optional "PER_DOC" or "PER_PAGE" or "PER_TABLE_ROW"
The extraction target specified.
high_resolution_mode: optional boolean
Whether to use high resolution mode for the extraction.
invalidate_cache: optional boolean
Whether to invalidate the cache for the extraction.
multimodal_fast_mode: optional boolean
DEPRECATED: Whether to use fast mode for multimodal extraction.
num_pages_context: optional number
Number of pages to pass as context on long document extraction.
page_range: optional string
Comma-separated list of page numbers or ranges to extract from (1-based, e.g., '1,3,5-7,9' or '1-3,8-10').
parse_model: optional "openai-gpt-4o" or "openai-gpt-4o-mini" or "openai-gpt-4-1" or 23 more
Public model names.
priority: optional "low" or "medium" or "high" or "critical"
The priority for the request. This field may be ignored or overwritten depending on the organization tier.
system_prompt: optional string
The system prompt to use for the extraction.
use_reasoning: optional boolean
Whether to use reasoning for the extraction.
data_schema: map[map[unknown] or array of unknown or string or 2 more]
The schema of the data.
name: string
The name of the extraction agent.
project_id: string
The ID of the project that the extraction agent belongs to.
created_at: optional string
The creation time of the extraction agent.
custom_configuration: optional "default"
Custom configuration type for the extraction agent. Currently supports 'default'.
updated_at: optional string
The last update time of the extraction agent.