## Get Parse Job `client.parsing.get(stringjobID, ParsingGetParamsquery?, RequestOptionsoptions?): ParsingGetResponse` **get** `/api/v2/parse/{job_id}` Retrieve a parse job with optional expanded content. By default returns job metadata only. Use `expand` to include parsed content: - `text` — plain text output - `markdown` — markdown output - `items` — structured page-by-page output - `job_metadata` — usage and processing details Content metadata fields (e.g. `text_content_metadata`) return presigned URLs for downloading large results. ### Parameters - `jobID: string` - `query: ParsingGetParams` - `expand?: Array` Fields to include: text, markdown, items, metadata, job_metadata, text_content_metadata, markdown_content_metadata, items_content_metadata, metadata_content_metadata, raw_words_content_metadata, xlsx_content_metadata, output_pdf_content_metadata, images_content_metadata. Metadata fields include presigned URLs. - `image_filenames?: string | null` Filter to specific image filenames (optional). Example: image_0.png,image_1.jpg - `organization_id?: string | null` - `project_id?: string | null` ### Returns - `ParsingGetResponse` Parse result response with job status and optional content or metadata. The job field is always included. Other fields are included based on expand parameters. - `job: Job` Parse job status and metadata - `id: string` Unique parse job identifier - `project_id: string` Project this job belongs to - `status: "PENDING" | "RUNNING" | "COMPLETED" | 2 more` Current job status: PENDING, RUNNING, COMPLETED, FAILED, or CANCELLED - `"PENDING"` - `"RUNNING"` - `"COMPLETED"` - `"FAILED"` - `"CANCELLED"` - `created_at?: string | null` Creation datetime - `error_message?: string | null` Error details when status is FAILED - `name?: string | null` Optional display name for this parse job - `tier?: string | null` Parsing tier used for this job - `updated_at?: string | null` Update datetime - `images_content_metadata?: ImagesContentMetadata | null` Metadata for all extracted images. - `images: Array` List of image metadata with presigned URLs - `filename: string` Image filename (e.g., 'image_0.png') - `index: number` Index of the image in the extraction order - `bbox?: Bbox | null` Bounding box for an image on its page. - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `category?: "screenshot" | "embedded" | "layout" | null` Image category: 'screenshot' (full page), 'embedded' (images in document), or 'layout' (cropped from layout detection) - `"screenshot"` - `"embedded"` - `"layout"` - `content_type?: string | null` MIME type of the image - `presigned_url?: string | null` Presigned URL to download the image - `size_bytes?: number | null` Deprecated: always returns None. Will be removed in a future release. - `total_count: number` Total number of extracted images - `items?: Items | null` Structured JSON result (if requested) - `pages: Array` List of structured pages or failed page entries - `StructuredResultPage` - `items: Array` List of structured items on the page - `TextItem` - `md: string` Markdown representation preserving formatting - `value: string` Text content - `bbox?: Array | null` List of bounding boxes - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `confidence?: number | null` Confidence score - `end_index?: number | null` End index in the text - `label?: string | null` Label for the bounding box - `start_index?: number | null` Start index in the text - `type?: "text"` Text item type - `"text"` - `HeadingItem` - `level: number` Heading level (1-6) - `md: string` Markdown representation preserving formatting - `value: string` Heading text content - `bbox?: Array | null` List of bounding boxes - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `confidence?: number | null` Confidence score - `end_index?: number | null` End index in the text - `label?: string | null` Label for the bounding box - `start_index?: number | null` Start index in the text - `type?: "heading"` Heading item type - `"heading"` - `ListItem` - `items: Array` List of nested text or list items - `TextItem` - `ListItem` - `md: string` Markdown representation preserving formatting - `ordered: boolean` Whether the list is ordered or unordered - `bbox?: Array | null` List of bounding boxes - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `confidence?: number | null` Confidence score - `end_index?: number | null` End index in the text - `label?: string | null` Label for the bounding box - `start_index?: number | null` Start index in the text - `type?: "list"` List item type - `"list"` - `CodeItem` - `md: string` Markdown representation preserving formatting - `value: string` Code content - `bbox?: Array | null` List of bounding boxes - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `confidence?: number | null` Confidence score - `end_index?: number | null` End index in the text - `label?: string | null` Label for the bounding box - `start_index?: number | null` Start index in the text - `language?: string | null` Programming language identifier - `type?: "code"` Code block item type - `"code"` - `TableItem` - `csv: string` CSV representation of the table - `html: string` HTML representation of the table - `md: string` Markdown representation preserving formatting - `rows: Array>` Table data as array of arrays (string, number, or null) - `string` - `number` - `bbox?: Array | null` List of bounding boxes - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `confidence?: number | null` Confidence score - `end_index?: number | null` End index in the text - `label?: string | null` Label for the bounding box - `start_index?: number | null` Start index in the text - `merged_from_pages?: Array | null` List of page numbers with tables that were merged into this table (e.g., [1, 2, 3, 4]) - `merged_into_page?: number | null` Populated when merged into another table. Page number where the full merged table begins (used on empty tables). - `parse_concerns?: Array | null` Quality concerns detected during table extraction, indicating the table may have issues - `details: string` Human-readable details about the concern - `type: string` Type of parse concern (e.g. header_value_type_mismatch, inconsistent_row_cell_count) - `type?: "table"` Table item type - `"table"` - `ImageItem` - `caption: string` Image caption - `md: string` Markdown representation preserving formatting - `url: string` URL to the image - `bbox?: Array | null` List of bounding boxes - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `confidence?: number | null` Confidence score - `end_index?: number | null` End index in the text - `label?: string | null` Label for the bounding box - `start_index?: number | null` Start index in the text - `type?: "image"` Image item type - `"image"` - `LinkItem` - `md: string` Markdown representation preserving formatting - `text: string` Display text of the link - `url: string` URL of the link - `bbox?: Array | null` List of bounding boxes - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `confidence?: number | null` Confidence score - `end_index?: number | null` End index in the text - `label?: string | null` Label for the bounding box - `start_index?: number | null` Start index in the text - `type?: "link"` Link item type - `"link"` - `HeaderItem` - `items: Array` List of items within the header - `TextItem` - `HeadingItem` - `ListItem` - `CodeItem` - `TableItem` - `ImageItem` - `LinkItem` - `md: string` Markdown representation preserving formatting - `bbox?: Array | null` List of bounding boxes - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `confidence?: number | null` Confidence score - `end_index?: number | null` End index in the text - `label?: string | null` Label for the bounding box - `start_index?: number | null` Start index in the text - `type?: "header"` Page header container - `"header"` - `FooterItem` - `items: Array` List of items within the footer - `TextItem` - `HeadingItem` - `ListItem` - `CodeItem` - `TableItem` - `ImageItem` - `LinkItem` - `md: string` Markdown representation preserving formatting - `bbox?: Array | null` List of bounding boxes - `h: number` Height of the bounding box - `w: number` Width of the bounding box - `x: number` X coordinate of the bounding box - `y: number` Y coordinate of the bounding box - `confidence?: number | null` Confidence score - `end_index?: number | null` End index in the text - `label?: string | null` Label for the bounding box - `start_index?: number | null` Start index in the text - `type?: "footer"` Page footer container - `"footer"` - `page_height: number` Height of the page in points - `page_number: number` Page number of the document - `page_width: number` Width of the page in points - `success: true` Success indicator - `true` - `FailedStructuredPage` - `error: string` Error message describing the failure - `page_number: number` Page number of the document - `success: false` Failure indicator - `false` - `job_metadata?: Record | null` Job execution metadata (if requested) - `markdown?: Markdown | null` Markdown result (if requested) - `pages: Array` List of markdown pages or failed page entries - `MarkdownResultPage` - `markdown: string` Markdown content of the page - `page_number: number` Page number of the document - `success: true` Success indicator - `true` - `footer?: string | null` Footer of the page in markdown - `header?: string | null` Header of the page in markdown - `FailedMarkdownPage` - `error: string` Error message describing the failure - `page_number: number` Page number of the document - `success: false` Failure indicator - `false` - `markdown_full?: string | null` Full raw markdown content (if requested) - `metadata?: Metadata | null` Result containing metadata (page level and general) for the parsed document. - `pages: Array` List of page metadata entries - `page_number: number` Page number of the document - `confidence?: number | null` Confidence score for the page parsing (0-1) - `cost_optimized?: boolean | null` Whether cost-optimized parsing was used for the page - `original_orientation_angle?: number | null` Original orientation angle of the page in degrees - `printed_page_number?: string | null` Printed page number as it appears in the document - `slide_section_name?: string | null` Section name from presentation slides - `speaker_notes?: string | null` Speaker notes from presentation slides - `triggered_auto_mode?: boolean | null` Whether auto mode was triggered for the page - `raw_parameters?: Record | null` - `result_content_metadata?: Record | null` Metadata including size, existence, and presigned URLs for result files - `size_bytes: number` Size of the result file in bytes - `exists?: boolean` Whether the result file exists in S3 - `presigned_url?: string | null` Presigned URL to download the result file - `text?: Text | null` Plain text result (if requested) - `pages: Array` List of text pages - `page_number: number` Page number of the document - `text: string` Plain text content of the page - `text_full?: string | null` Full raw text content (if requested) ### Example ```typescript import LlamaCloud from '@llamaindex/llama-cloud'; const client = new LlamaCloud({ apiKey: process.env['LLAMA_CLOUD_API_KEY'], // This is the default and can be omitted }); const parsing = await client.parsing.get('job_id'); console.log(parsing.job); ``` #### Response ```json { "job": { "id": "pjb-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "project_id": "prj-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "status": "PENDING", "created_at": "2019-12-27T18:11:19.117Z", "error_message": "error_message", "name": "Q4 Financial Report", "tier": "fast", "updated_at": "2019-12-27T18:11:19.117Z" }, "images_content_metadata": { "images": [ { "filename": "filename", "index": 0, "bbox": { "h": 0, "w": 0, "x": 0, "y": 0 }, "category": "screenshot", "content_type": "content_type", "presigned_url": "presigned_url", "size_bytes": 0 } ], "total_count": 0 }, "items": { "pages": [ { "items": [ { "md": "md", "value": "value", "bbox": [ { "h": 0, "w": 0, "x": 0, "y": 0, "confidence": 0, "end_index": 0, "label": "label", "start_index": 0 } ], "type": "text" } ], "page_height": 0, "page_number": 0, "page_width": 0, "success": true } ] }, "job_metadata": { "foo": "bar" }, "markdown": { "pages": [ { "markdown": "markdown", "page_number": 0, "success": true, "footer": "footer", "header": "header" } ] }, "markdown_full": "markdown_full", "metadata": { "pages": [ { "page_number": 0, "confidence": 0, "cost_optimized": true, "original_orientation_angle": 0, "printed_page_number": "printed_page_number", "slide_section_name": "slide_section_name", "speaker_notes": "speaker_notes", "triggered_auto_mode": true } ] }, "raw_parameters": { "foo": "bar" }, "result_content_metadata": { "foo": { "size_bytes": 0, "exists": true, "presigned_url": "presigned_url" } }, "text": { "pages": [ { "page_number": 0, "text": "text" } ] }, "text_full": "text_full" } ```