## Get Extract Job `ExtractV2Job extract().get(ExtractGetParamsparams = ExtractGetParams.none(), RequestOptionsrequestOptions = RequestOptions.none())` **get** `/api/v2/extract/{job_id}` Get a single extraction job by ID. Returns the job status and results when complete. Use `expand=configuration` to include the full configuration used, and `expand=extract_metadata` for per-field metadata. ### Parameters - `ExtractGetParams params` - `Optional jobId` - `Optional> expand` Additional fields to include: configuration, extract_metadata - `Optional organizationId` - `Optional projectId` ### Returns - `class ExtractV2Job:` An extraction job. - `String id` Unique job identifier (job_id) - `LocalDateTime createdAt` Creation timestamp - `String fileInput` File ID or parse job ID that was extracted - `String projectId` Project this job belongs to - `String status` Current job status. - `PENDING` — queued, not yet started - `RUNNING` — actively processing - `COMPLETED` — finished successfully - `FAILED` — terminated with an error - `CANCELLED` — cancelled by user - `LocalDateTime updatedAt` Last update timestamp - `Optional configuration` Extract configuration combining parse and extract settings. - `DataSchema dataSchema` JSON Schema defining the fields to extract. Validate with the /schema/validate endpoint first. - `class UnionMember0:` - `List` - `String` - `double` - `boolean` - `Optional citeSources` Include citations in results - `Optional confidenceScores` Include confidence scores in results - `Optional extractionTarget` Granularity of extraction: per_doc returns one object per document, per_page returns one object per page, per_table_row returns one object per table row - `PER_DOC("per_doc")` - `PER_PAGE("per_page")` - `PER_TABLE_ROW("per_table_row")` - `Optional maxPages` Maximum number of pages to process. Omit for no limit. - `Optional parseConfigId` Saved parse configuration ID to control how the document is parsed before extraction - `Optional parseTier` Parse tier to use before extraction. Defaults to the extract tier if not specified. - `Optional systemPrompt` Custom system prompt to guide extraction behavior - `Optional targetPages` Comma-separated page numbers or ranges to process (1-based). Omit to process all pages. - `Optional tier` Extract tier: cost_effective (5 credits/page) or agentic (15 credits/page) - `COST_EFFECTIVE("cost_effective")` - `AGENTIC("agentic")` - `Optional version` Use 'latest' for the latest release for the selected tier or a date string (YYYY-MM-DD format) to pin to the nearest release at or before that date. - `Optional configurationId` Saved extract configuration ID used for this job, if any - `Optional errorMessage` Error details when status is FAILED - `Optional extractMetadata` Extraction metadata. - `Optional fieldMetadata` Metadata for extracted fields including document, page, and row level info. - `Optional documentMetadata` Per-field metadata keyed by field name from your schema. Scalar fields (e.g. `vendor`) map to a FieldMetadataEntry with citation and confidence. Array fields (e.g. `items`) map to a list where each element contains per-sub-field FieldMetadataEntry objects, indexed by array position. Nested objects contain sub-field entries recursively. - `class UnionMember0:` - `List` - `String` - `double` - `boolean` - `Optional> pageMetadata` Per-page metadata when extraction_target is per_page - `class UnionMember0:` - `List` - `String` - `double` - `boolean` - `Optional> rowMetadata` Per-row metadata when extraction_target is per_table_row - `class UnionMember0:` - `List` - `String` - `double` - `boolean` - `Optional parseJobId` Reference to the ParseJob ID used for parsing - `Optional parseTier` Parse tier used for parsing the document - `Optional extractResult` Extracted data conforming to the data_schema. Returns a single object for per_doc, or an array for per_page / per_table_row. - `class UnionMember0:` - `class InnerUnionMember0:` - `List` - `String` - `double` - `boolean` - `List` - `class UnionMember0:` - `List` - `String` - `double` - `boolean` - `Optional metadata` Job-level metadata. - `Optional usage` Extraction usage metrics. - `Optional numPagesExtracted` Number of pages extracted ### Example ```java package com.llamacloud_prod.api.example; import com.llamacloud_prod.api.client.LlamaCloudClient; import com.llamacloud_prod.api.client.okhttp.LlamaCloudOkHttpClient; import com.llamacloud_prod.api.models.extract.ExtractGetParams; import com.llamacloud_prod.api.models.extract.ExtractV2Job; public final class Main { private Main() {} public static void main(String[] args) { LlamaCloudClient client = LlamaCloudOkHttpClient.fromEnv(); ExtractV2Job extractV2Job = client.extract().get("job_id"); } } ``` #### Response ```json { "id": "ext-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "created_at": "2019-12-27T18:11:19.117Z", "file_input": "dfl-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "project_id": "prj-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "status": "COMPLETED", "updated_at": "2019-12-27T18:11:19.117Z", "configuration": { "data_schema": { "foo": { "foo": "bar" } }, "cite_sources": true, "confidence_scores": true, "extraction_target": "per_doc", "max_pages": 10, "parse_config_id": "cfg-11111111-2222-3333-4444-555555555555", "parse_tier": "fast", "system_prompt": "Extract all monetary values in USD. If a currency is not specified, assume USD.", "target_pages": "1,3,5-7", "tier": "cost_effective", "version": "latest" }, "configuration_id": "cfg-11111111-2222-3333-4444-555555555555", "error_message": "error_message", "extract_metadata": { "field_metadata": { "document_metadata": { "items": [ { "amount": { "citation": [ { "matching_text": "$10.00", "page": 1 } ], "confidence": 1 }, "description": { "citation": [ { "matching_text": "$10/month", "page": 1 } ], "confidence": 0.998 } } ], "total": { "citation": "bar", "confidence": "bar" }, "vendor": { "citation": "bar", "confidence": "bar", "extraction_confidence": "bar", "parsing_confidence": "bar" } }, "page_metadata": [ { "foo": { "foo": "bar" } } ], "row_metadata": [ { "foo": { "foo": "bar" } } ] }, "parse_job_id": "parse_job_id", "parse_tier": "parse_tier" }, "extract_result": { "foo": { "foo": "bar" } }, "metadata": { "usage": { "num_pages_extracted": 0 } } } ```