Beta
BetaAgent Data
Get Agent Data
Update Agent Data
Delete Agent Data
Create Agent Data
Search Agent Data
Aggregate Agent Data
Delete Agent Data By Query
ModelsExpand Collapse
AgentData = object { data, deployment_name, id, 4 more }
API Result for a single agent data item
BetaParse Configurations
Create Parse Configuration
List Parse Configurations
Get Parse Configuration
Update Parse Configuration
Delete Parse Configuration
ModelsExpand Collapse
ParseConfiguration = object { id, created_at, name, 6 more }
Parse configuration schema.
id: string
Unique identifier for the parse configuration
created_at: string
Creation timestamp
name: string
Name of the parse configuration
parameters: LlamaParseParameters { adaptive_long_table, aggressive_table_extraction, annotate_links, 115 more }
LlamaParseParameters configuration
images_to_save: optional array of "screenshot" or "embedded" or "layout"
Enum for representing the mode of parsing to be used.
priority: optional "low" or "medium" or "high" or "critical"
The priority for the request. This field may be ignored or overwritten depending on the organization tier.
Enum for representing the different available page error handling modes.
webhook_configurations: optional array of WebhookConfiguration { webhook_events, webhook_headers, webhook_output_format, webhook_url }
The outbound webhook configurations
webhook_events: optional array of "extract.pending" or "extract.success" or "extract.error" or 13 more
List of event names to subscribe to
webhook_headers: optional map[string]
Custom HTTP headers to include with webhook requests.
webhook_output_format: optional string
The output format to use for the webhook. Defaults to string if none supplied. Currently supported values: string, json
webhook_url: optional string
The URL to send webhook notifications to.
source_id: string
ID of the source
source_type: string
Type of the source (e.g., 'project')
updated_at: string
Last update timestamp
version: string
Version of the configuration
creator: optional string
Creator of the configuration
ParseConfigurationCreate = object { name, parameters, version, 3 more }
Schema for creating a new parse configuration (API boundary).
name: string
Name of the parse configuration
parameters: LlamaParseParameters { adaptive_long_table, aggressive_table_extraction, annotate_links, 115 more }
LlamaParseParameters configuration
images_to_save: optional array of "screenshot" or "embedded" or "layout"
Enum for representing the mode of parsing to be used.
priority: optional "low" or "medium" or "high" or "critical"
The priority for the request. This field may be ignored or overwritten depending on the organization tier.
Enum for representing the different available page error handling modes.
webhook_configurations: optional array of WebhookConfiguration { webhook_events, webhook_headers, webhook_output_format, webhook_url }
The outbound webhook configurations
webhook_events: optional array of "extract.pending" or "extract.success" or "extract.error" or 13 more
List of event names to subscribe to
webhook_headers: optional map[string]
Custom HTTP headers to include with webhook requests.
webhook_output_format: optional string
The output format to use for the webhook. Defaults to string if none supplied. Currently supported values: string, json
webhook_url: optional string
The URL to send webhook notifications to.
version: string
Version of the configuration
creator: optional string
Creator of the configuration
source_id: optional string
ID of the source
source_type: optional string
Type of the source (e.g., 'project')
ParseConfigurationQueryResponse = object { items, next_page_token, total_size }
Response schema for paginated parse configuration queries.
The list of items.
id: string
Unique identifier for the parse configuration
created_at: string
Creation timestamp
name: string
Name of the parse configuration
parameters: LlamaParseParameters { adaptive_long_table, aggressive_table_extraction, annotate_links, 115 more }
LlamaParseParameters configuration
images_to_save: optional array of "screenshot" or "embedded" or "layout"
Enum for representing the mode of parsing to be used.
priority: optional "low" or "medium" or "high" or "critical"
The priority for the request. This field may be ignored or overwritten depending on the organization tier.
Enum for representing the different available page error handling modes.
webhook_configurations: optional array of WebhookConfiguration { webhook_events, webhook_headers, webhook_output_format, webhook_url }
The outbound webhook configurations
webhook_events: optional array of "extract.pending" or "extract.success" or "extract.error" or 13 more
List of event names to subscribe to
webhook_headers: optional map[string]
Custom HTTP headers to include with webhook requests.
webhook_output_format: optional string
The output format to use for the webhook. Defaults to string if none supplied. Currently supported values: string, json
webhook_url: optional string
The URL to send webhook notifications to.
source_id: string
ID of the source
source_type: string
Type of the source (e.g., 'project')
updated_at: string
Last update timestamp
version: string
Version of the configuration
creator: optional string
Creator of the configuration
next_page_token: optional string
A token, which can be sent as page_token to retrieve the next page. If this field is omitted, there are no subsequent pages.
total_size: optional number
The total number of items available. This is only populated when specifically requested. The value may be an estimate and can be used for display purposes only.
BetaSheets
Create Spreadsheet Job
List Spreadsheet Jobs
Get Spreadsheet Job
Get Result Region
Delete Spreadsheet Job
ModelsExpand Collapse
SheetsJob = object { id, config, created_at, 10 more }
A spreadsheet parsing job
id: string
The ID of the job
config: SheetsParsingConfig { extraction_range, flatten_hierarchical_tables, generate_additional_metadata, 4 more }
Configuration for the parsing job
extraction_range: optional string
A1 notation of the range to extract a single region from. If None, the entire sheet is used.
flatten_hierarchical_tables: optional boolean
Return a flattened dataframe when a detected table is recognized as hierarchical.
generate_additional_metadata: optional boolean
Whether to generate additional metadata (title, description) for each extracted region.
include_hidden_cells: optional boolean
Whether to include hidden cells when extracting regions from the spreadsheet.
sheet_names: optional array of string
The names of the sheets to extract regions from. If empty, all sheets will be processed.
table_merge_sensitivity: optional "strong" or "weak"
Influences how likely similar-looking regions are merged into a single table. Useful for spreadsheets that either have sparse tables (strong merging) or many distinct tables close together (weak merging).
use_experimental_processing: optional boolean
Enables experimental processing. Accuracy may be impacted.
created_at: string
When the job was created
file_id: string
The ID of the input file
project_id: string
The ID of the project
The status of the parsing job
updated_at: string
When the job was last updated
user_id: string
The ID of the user
errors: optional array of string
Any errors encountered
Schema for a file.
id: string
Unique identifier
project_id: string
The ID of the project that the file belongs to
created_at: optional string
Creation datetime
data_source_id: optional string
The ID of the data source that the file belongs to
expires_at: optional string
The expiration date for the file. Files past this date can be deleted.
external_file_id: optional string
The ID of the file in the external system
file_size: optional number
Size of the file in bytes
file_type: optional string
File type (e.g. pdf, docx, etc.)
last_modified_at: optional string
The last modified time of the file
permission_info: optional map[map[unknown] or array of unknown or string or 2 more]
Permission information for the file
purpose: optional string
The intended purpose of the file (e.g., 'user_data', 'parse', 'extract', 'split', 'classify')
resource_info: optional map[map[unknown] or array of unknown or string or 2 more]
Resource information for the file
updated_at: optional string
Update datetime
regions: optional array of object { location, region_type, sheet_name, 3 more }
All extracted regions (populated when job is complete)
location: string
Location of the region in the spreadsheet
region_type: string
Type of the extracted region
sheet_name: string
Worksheet name where region was found
description: optional string
Generated description for the region
region_id: optional string
Unique identifier for this region within the file
title: optional string
Generated title for the region
success: optional boolean
Whether the job completed successfully
worksheet_metadata: optional array of object { sheet_name, description, title }
Metadata for each processed worksheet (populated when job is complete)
sheet_name: string
Name of the worksheet
description: optional string
Generated description of the worksheet
title: optional string
Generated title for the worksheet
SheetsParsingConfig = object { extraction_range, flatten_hierarchical_tables, generate_additional_metadata, 4 more }
Configuration for spreadsheet parsing and region extraction
extraction_range: optional string
A1 notation of the range to extract a single region from. If None, the entire sheet is used.
flatten_hierarchical_tables: optional boolean
Return a flattened dataframe when a detected table is recognized as hierarchical.
generate_additional_metadata: optional boolean
Whether to generate additional metadata (title, description) for each extracted region.
include_hidden_cells: optional boolean
Whether to include hidden cells when extracting regions from the spreadsheet.
sheet_names: optional array of string
The names of the sheets to extract regions from. If empty, all sheets will be processed.
table_merge_sensitivity: optional "strong" or "weak"
Influences how likely similar-looking regions are merged into a single table. Useful for spreadsheets that either have sparse tables (strong merging) or many distinct tables close together (weak merging).
use_experimental_processing: optional boolean
Enables experimental processing. Accuracy may be impacted.
BetaDirectories
Create Directory
List Directories
Get Directory
Update Directory
Delete Directory
BetaDirectoriesFiles
Add Directory File
List Directory Files
Get Directory File
Update Directory File
Delete Directory File
Upload File To Directory
BetaBatch
Create Batch Job
List Batch Jobs
Get Batch Job Status
Cancel Batch Job
BetaBatchJob Items
List Batch Job Items
Get Item Processing Results
BetaSplit
Create Split Job
List Split Jobs
Get Split Job
ModelsExpand Collapse
SplitCategory = object { name, description }
Category definition for document splitting.
name: string
Name of the category.
description: optional string
Optional description of what content belongs in this category.
SplitDocumentInput = object { type, value }
Document input specification.
type: string
Type of document input. Valid values are: file_id
value: string
Document identifier.
SplitResultResponse = object { segments }
Result of a completed split job.
List of document segments.
category: string
Category name this split belongs to.
confidence_category: string
Categorical confidence level. Valid values are: high, medium, low.
pages: array of number
1-indexed page numbers in this split.
SplitSegmentResponse = object { category, confidence_category, pages }
A segment of the split document.
category: string
Category name this split belongs to.
confidence_category: string
Categorical confidence level. Valid values are: high, medium, low.
pages: array of number
1-indexed page numbers in this split.