## Get Extract Job `$ llamacloud-prod extract get` **get** `/api/v2/extract/{job_id}` Get a single extraction job by ID. Returns the job status and results when complete. Use `expand=configuration` to include the full configuration used, and `expand=extract_metadata` for per-field metadata. ### Parameters - `--job-id: string` - `--expand: optional array of string` Additional fields to include: configuration, extract_metadata - `--organization-id: optional string` - `--project-id: optional string` ### Returns - `extract_v2_job: object { id, created_at, file_input, 9 more }` An extraction job. - `id: string` Unique job identifier (job_id) - `created_at: string` Creation timestamp - `file_input: string` File ID or parse job ID that was extracted - `project_id: string` Project this job belongs to - `status: string` Current job status. - `PENDING` — queued, not yet started - `RUNNING` — actively processing - `COMPLETED` — finished successfully - `FAILED` — terminated with an error - `CANCELLED` — cancelled by user - `updated_at: string` Last update timestamp - `configuration: optional object { data_schema, cite_sources, confidence_scores, 8 more }` Extract configuration combining parse and extract settings. - `data_schema: map[map[unknown] or array of unknown or string or 2 more]` JSON Schema defining the fields to extract. Validate with the /schema/validate endpoint first. - `union_member_0: map[unknown]` - `union_member_1: array of unknown` - `union_member_2: string` - `union_member_3: number` - `union_member_4: boolean` - `cite_sources: optional boolean` Include citations in results - `confidence_scores: optional boolean` Include confidence scores in results - `extract_version: optional string` Extract algorithm version. Use 'latest' for the default pipeline or a date string (e.g. '2026-01-08') to pin to a specific release. - `extraction_target: optional "per_doc" or "per_page" or "per_table_row"` Granularity of extraction: per_doc returns one object per document, per_page returns one object per page, per_table_row returns one object per table row - `"per_doc"` - `"per_page"` - `"per_table_row"` - `max_pages: optional number` Maximum number of pages to process. Omit for no limit. - `parse_config_id: optional string` Saved parse configuration ID to control how the document is parsed before extraction - `parse_tier: optional string` Parse tier to use before extraction. Defaults to the extract tier if not specified. - `system_prompt: optional string` Custom system prompt to guide extraction behavior - `target_pages: optional string` Comma-separated page numbers or ranges to process (1-based). Omit to process all pages. - `tier: optional "cost_effective" or "agentic"` Extract tier: cost_effective (5 credits/page) or agentic (15 credits/page) - `"cost_effective"` - `"agentic"` - `configuration_id: optional string` Saved extract configuration ID used for this job, if any - `error_message: optional string` Error details when status is FAILED - `extract_metadata: optional object { field_metadata, parse_job_id, parse_tier }` Extraction metadata. - `field_metadata: optional object { document_metadata, page_metadata, row_metadata }` Metadata for extracted fields including document, page, and row level info. - `document_metadata: optional map[map[unknown] or array of unknown or string or 2 more]` Per-field metadata keyed by field name from your schema. Scalar fields (e.g. `vendor`) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. `items`) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively. - `union_member_0: map[unknown]` - `union_member_1: array of unknown` - `union_member_2: string` - `union_member_3: number` - `union_member_4: boolean` - `page_metadata: optional array of map[map[unknown] or array of unknown or string or 2 more]` Per-page metadata when extraction_target is per_page - `union_member_0: map[unknown]` - `union_member_1: array of unknown` - `union_member_2: string` - `union_member_3: number` - `union_member_4: boolean` - `row_metadata: optional array of map[map[unknown] or array of unknown or string or 2 more]` Per-row metadata when extraction_target is per_table_row - `union_member_0: map[unknown]` - `union_member_1: array of unknown` - `union_member_2: string` - `union_member_3: number` - `union_member_4: boolean` - `parse_job_id: optional string` Reference to the ParseJob ID used for parsing - `parse_tier: optional string` Parse tier used for parsing the document - `extract_result: optional map[map[unknown] or array of unknown or string or 2 more] or array of map[map[unknown] or array of unknown or string or 2 more]` Extracted data conforming to the data_schema. Returns a single object for per_doc, or an array for per_page / per_table_row. - `union_member_0: map[map[unknown] or array of unknown or string or 2 more]` - `union_member_0: map[unknown]` - `union_member_1: array of unknown` - `union_member_2: string` - `union_member_3: number` - `union_member_4: boolean` - `union_member_1: array of map[map[unknown] or array of unknown or string or 2 more]` - `union_member_0: map[unknown]` - `union_member_1: array of unknown` - `union_member_2: string` - `union_member_3: number` - `union_member_4: boolean` - `metadata: optional object { usage }` Job-level metadata. - `usage: optional object { num_document_tokens, num_output_tokens, num_pages_extracted }` Extraction usage metrics. - `num_document_tokens: optional number` Number of document tokens - `num_output_tokens: optional number` Number of output tokens - `num_pages_extracted: optional number` Number of pages extracted ### Example ```cli llamacloud-prod extract get \ --api-key 'My API Key' \ --job-id job_id ``` #### Response ```json { "id": "ext-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "created_at": "2019-12-27T18:11:19.117Z", "file_input": "dfl-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "project_id": "prj-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "status": "COMPLETED", "updated_at": "2019-12-27T18:11:19.117Z", "configuration": { "data_schema": { "foo": { "foo": "bar" } }, "cite_sources": true, "confidence_scores": true, "extract_version": "latest", "extraction_target": "per_doc", "max_pages": 10, "parse_config_id": "cfg-11111111-2222-3333-4444-555555555555", "parse_tier": "fast", "system_prompt": "Extract all monetary values in USD. If a currency is not specified, assume USD.", "target_pages": "1,3,5-7", "tier": "cost_effective" }, "configuration_id": "cfg-11111111-2222-3333-4444-555555555555", "error_message": "error_message", "extract_metadata": { "field_metadata": { "document_metadata": { "items": [ { "amount": { "citation": [ { "matching_text": "$10.00", "page": 1 } ], "confidence": 1 }, "description": { "citation": [ { "matching_text": "$10/month", "page": 1 } ], "confidence": 0.998 } } ], "total": { "citation": "bar", "confidence": "bar" }, "vendor": { "citation": "bar", "confidence": "bar", "extraction_confidence": "bar", "parsing_confidence": "bar" } }, "page_metadata": [ { "foo": { "foo": "bar" } } ], "row_metadata": [ { "foo": { "foo": "bar" } } ] }, "parse_job_id": "parse_job_id", "parse_tier": "parse_tier" }, "extract_result": { "foo": { "foo": "bar" } }, "metadata": { "usage": { "num_document_tokens": 0, "num_output_tokens": 0, "num_pages_extracted": 0 } } } ```