# Batch ## Create Batch Job `BatchCreateResponse beta().batch().create(BatchCreateParamsparams, RequestOptionsrequestOptions = RequestOptions.none())` **post** `/api/v1/beta/batch-processing` Create a batch processing job. Processes files from a directory or a specific list of item IDs. Supports batch parsing and classification operations. Provide either `directory_id` to process all files in a directory, or `item_ids` for specific items. The job runs asynchronously — poll `GET /batch/{job_id}` for progress. ### Parameters - `BatchCreateParams params` - `Optional organizationId` - `Optional projectId` - `Optional temporalNamespace` - `JobConfig jobConfig` Job configuration — either a parse or classify config - `class BatchParseJobRecordCreate:` Batch-specific parse job record for batch processing. This model contains the metadata and configuration for a batch parse job, but excludes file-specific information. It's used as input to the batch parent workflow and combined with DirectoryFile data to create full ParseJobRecordCreate instances for each file. Attributes: job_name: Must be PARSE_RAW_FILE partitions: Partitions for job output location parameters: Generic parse configuration (BatchParseJobConfig) session_id: Upstream request ID for tracking correlation_id: Correlation ID for cross-service tracking parent_job_execution_id: Parent job execution ID if nested user_id: User who created the job project_id: Project this job belongs to webhook_url: Optional webhook URL for job completion notifications - `Optional correlationId` The correlation ID for this job. Used for tracking the job across services. - `Optional jobName` - `PARSE_RAW_FILE_JOB("parse_raw_file_job")` - `Optional parameters` Generic parse job configuration for batch processing. This model contains the parsing configuration that applies to all files in a batch, but excludes file-specific fields like file_name, file_id, etc. Those file-specific fields are populated from DirectoryFile data when creating individual ParseJobRecordCreate instances for each file. The fields in this model should be generic settings that apply uniformly to all files being processed in the batch. - `Optional adaptiveLongTable` - `Optional aggressiveTableExtraction` - `Optional annotateLinks` - `Optional autoMode` - `Optional autoModeConfigurationJson` - `Optional autoModeTriggerOnImageInPage` - `Optional autoModeTriggerOnRegexpInPage` - `Optional autoModeTriggerOnTableInPage` - `Optional autoModeTriggerOnTextInPage` - `Optional azureOpenAIApiVersion` - `Optional azureOpenAIDeploymentName` - `Optional azureOpenAIEndpoint` - `Optional azureOpenAIKey` - `Optional bboxBottom` - `Optional bboxLeft` - `Optional bboxRight` - `Optional bboxTop` - `Optional boundingBox` - `Optional compactMarkdownTable` - `Optional complementalFormattingInstruction` - `Optional contentGuidelineInstruction` - `Optional continuousMode` - `Optional customMetadata` The custom metadata to attach to the documents. - `Optional disableImageExtraction` - `Optional disableOcr` - `Optional disableReconstruction` - `Optional doNotCache` - `Optional doNotUnrollColumns` - `Optional enableCostOptimizer` - `Optional extractCharts` - `Optional extractLayout` - `Optional extractPrintedPageNumber` - `Optional fastMode` - `Optional formattingInstruction` - `Optional gpt4oApiKey` - `Optional gpt4oMode` - `Optional guessXlsxSheetName` - `Optional hideFooters` - `Optional hideHeaders` - `Optional highResOcr` - `Optional htmlMakeAllElementsVisible` - `Optional htmlRemoveFixedElements` - `Optional htmlRemoveNavigationElements` - `Optional httpProxy` - `Optional ignoreDocumentElementsForLayoutDetection` - `Optional> imagesToSave` - `SCREENSHOT("screenshot")` - `EMBEDDED("embedded")` - `LAYOUT("layout")` - `Optional inlineImagesInMarkdown` - `Optional inputS3Path` - `Optional inputS3Region` The region for the input S3 bucket. - `Optional inputUrl` - `Optional internalIsScreenshotJob` - `Optional invalidateCache` - `Optional isFormattingInstruction` - `Optional jobTimeoutExtraTimePerPageInSeconds` - `Optional jobTimeoutInSeconds` - `Optional keepPageSeparatorWhenMergingTables` - `Optional lang` The language. - `Optional> languages` - `AF("af")` - `AZ("az")` - `BS("bs")` - `CS("cs")` - `CY("cy")` - `DA("da")` - `DE("de")` - `EN("en")` - `ES("es")` - `ET("et")` - `FR("fr")` - `GA("ga")` - `HR("hr")` - `HU("hu")` - `ID("id")` - `IS("is")` - `IT("it")` - `KU("ku")` - `LA("la")` - `LT("lt")` - `LV("lv")` - `MI("mi")` - `MS("ms")` - `MT("mt")` - `NL("nl")` - `NO("no")` - `OC("oc")` - `PI("pi")` - `PL("pl")` - `PT("pt")` - `RO("ro")` - `RS_LATIN("rs_latin")` - `SK("sk")` - `SL("sl")` - `SQ("sq")` - `SV("sv")` - `SW("sw")` - `TL("tl")` - `TR("tr")` - `UZ("uz")` - `VI("vi")` - `AR("ar")` - `FA("fa")` - `UG("ug")` - `UR("ur")` - `BN("bn")` - `AS("as")` - `MNI("mni")` - `RU("ru")` - `RS_CYRILLIC("rs_cyrillic")` - `BE("be")` - `BG("bg")` - `UK("uk")` - `MN("mn")` - `ABQ("abq")` - `ADY("ady")` - `KBD("kbd")` - `AVA("ava")` - `DAR("dar")` - `INH("inh")` - `CHE("che")` - `LBE("lbe")` - `LEZ("lez")` - `TAB("tab")` - `TJK("tjk")` - `HI("hi")` - `MR("mr")` - `NE("ne")` - `BH("bh")` - `MAI("mai")` - `ANG("ang")` - `BHO("bho")` - `MAH("mah")` - `SCK("sck")` - `NEW("new")` - `GOM("gom")` - `SA("sa")` - `BGC("bgc")` - `TH("th")` - `CH_SIM("ch_sim")` - `CH_TRA("ch_tra")` - `JA("ja")` - `KO("ko")` - `TA("ta")` - `TE("te")` - `KN("kn")` - `Optional layoutAware` - `Optional lineLevelBoundingBox` - `Optional markdownTableMultilineHeaderSeparator` - `Optional maxPages` - `Optional maxPagesEnforced` - `Optional mergeTablesAcrossPagesInMarkdown` - `Optional model` - `Optional outlinedTableExtraction` - `Optional outputPdfOfDocument` - `Optional outputS3PathPrefix` If specified, llamaParse will save the output to the specified path. All output file will use this 'prefix' should be a valid s3:// url - `Optional outputS3Region` The region for the output S3 bucket. - `Optional outputTablesAsHtml` - `Optional outputBucket` The output bucket. - `Optional pageErrorTolerance` - `Optional pageFooterPrefix` - `Optional pageFooterSuffix` - `Optional pageHeaderPrefix` - `Optional pageHeaderSuffix` - `Optional pagePrefix` - `Optional pageSeparator` - `Optional pageSuffix` - `Optional parseMode` Enum for representing the mode of parsing to be used. - `PARSE_PAGE_WITHOUT_LLM("parse_page_without_llm")` - `PARSE_PAGE_WITH_LLM("parse_page_with_llm")` - `PARSE_PAGE_WITH_LVM("parse_page_with_lvm")` - `PARSE_PAGE_WITH_AGENT("parse_page_with_agent")` - `PARSE_PAGE_WITH_LAYOUT_AGENT("parse_page_with_layout_agent")` - `PARSE_DOCUMENT_WITH_LLM("parse_document_with_llm")` - `PARSE_DOCUMENT_WITH_LVM("parse_document_with_lvm")` - `PARSE_DOCUMENT_WITH_AGENT("parse_document_with_agent")` - `Optional parsingInstruction` - `Optional pipelineId` The pipeline ID. - `Optional preciseBoundingBox` - `Optional premiumMode` - `Optional presentationOutOfBoundsContent` - `Optional presentationSkipEmbeddedData` - `Optional preserveLayoutAlignmentAcrossPages` - `Optional preserveVerySmallText` - `Optional preset` - `Optional priority` The priority for the request. This field may be ignored or overwritten depending on the organization tier. - `LOW("low")` - `MEDIUM("medium")` - `HIGH("high")` - `CRITICAL("critical")` - `Optional projectId` - `Optional removeHiddenText` - `Optional replaceFailedPageMode` Enum for representing the different available page error handling modes. - `RAW_TEXT("raw_text")` - `BLANK_PAGE("blank_page")` - `ERROR_MESSAGE("error_message")` - `Optional replaceFailedPageWithErrorMessagePrefix` - `Optional replaceFailedPageWithErrorMessageSuffix` - `Optional resourceInfo` The resource info about the file - `Optional saveImages` - `Optional skipDiagonalText` - `Optional specializedChartParsingAgentic` - `Optional specializedChartParsingEfficient` - `Optional specializedChartParsingPlus` - `Optional specializedImageParsing` - `Optional spreadsheetExtractSubTables` - `Optional spreadsheetForceFormulaComputation` - `Optional spreadsheetIncludeHiddenSheets` - `Optional strictModeBuggyFont` - `Optional strictModeImageExtraction` - `Optional strictModeImageOcr` - `Optional strictModeReconstruction` - `Optional structuredOutput` - `Optional structuredOutputJsonSchema` - `Optional structuredOutputJsonSchemaName` - `Optional systemPrompt` - `Optional systemPromptAppend` - `Optional takeScreenshot` - `Optional targetPages` - `Optional tier` - `Optional type` - `PARSE("parse")` - `Optional useVendorMultimodalModel` - `Optional userPrompt` - `Optional vendorMultimodalApiKey` - `Optional vendorMultimodalModelName` - `Optional version` - `Optional> webhookConfigurations` Outbound webhook endpoints to notify on job status changes - `Optional> webhookEvents` Events to subscribe to (e.g. 'parse.success', 'extract.error'). If null, all events are delivered. - `EXTRACT_PENDING("extract.pending")` - `EXTRACT_SUCCESS("extract.success")` - `EXTRACT_ERROR("extract.error")` - `EXTRACT_PARTIAL_SUCCESS("extract.partial_success")` - `EXTRACT_CANCELLED("extract.cancelled")` - `PARSE_PENDING("parse.pending")` - `PARSE_RUNNING("parse.running")` - `PARSE_SUCCESS("parse.success")` - `PARSE_ERROR("parse.error")` - `PARSE_PARTIAL_SUCCESS("parse.partial_success")` - `PARSE_CANCELLED("parse.cancelled")` - `CLASSIFY_PENDING("classify.pending")` - `CLASSIFY_RUNNING("classify.running")` - `CLASSIFY_SUCCESS("classify.success")` - `CLASSIFY_ERROR("classify.error")` - `CLASSIFY_PARTIAL_SUCCESS("classify.partial_success")` - `CLASSIFY_CANCELLED("classify.cancelled")` - `SHEETS_PENDING("sheets.pending")` - `SHEETS_SUCCESS("sheets.success")` - `SHEETS_ERROR("sheets.error")` - `SHEETS_PARTIAL_SUCCESS("sheets.partial_success")` - `SHEETS_CANCELLED("sheets.cancelled")` - `UNMAPPED_EVENT("unmapped_event")` - `Optional webhookHeaders` Custom HTTP headers sent with each webhook request (e.g. auth tokens) - `Optional webhookOutputFormat` Response format sent to the webhook: 'string' (default) or 'json' - `Optional webhookUrl` URL to receive webhook POST notifications - `Optional webhookUrl` - `Optional parentJobExecutionId` The ID of the parent job execution. - `Optional partitions` The partitions for this execution. Used for determining where to save job output. - `Optional projectId` The ID of the project this job belongs to. - `Optional sessionId` The upstream request ID that created this job. Used for tracking the job across services. - `Optional userId` The ID of the user that created this job - `Optional webhookUrl` The URL that needs to be called at the end of the parsing job. - `class ClassifyJob:` A classify job. - `String id` Unique identifier - `String projectId` The ID of the project - `List rules` The rules to classify the files - `String description` Natural language description of what to classify. Be specific about the content characteristics that identify this document type. - `String type` The document type to assign when this rule matches (e.g., 'invoice', 'receipt', 'contract') - `StatusEnum status` The status of the classify job - `PENDING("PENDING")` - `SUCCESS("SUCCESS")` - `ERROR("ERROR")` - `PARTIAL_SUCCESS("PARTIAL_SUCCESS")` - `CANCELLED("CANCELLED")` - `String userId` The ID of the user - `Optional createdAt` Creation datetime - `Optional effectiveAt` - `Optional errorMessage` Error message for the latest job attempt, if any. - `Optional jobRecordId` The job record ID associated with this status, if any. - `Optional mode` The classification mode to use - `FAST("FAST")` - `MULTIMODAL("MULTIMODAL")` - `Optional parsingConfiguration` The configuration for the parsing job - `Optional lang` The language to parse the files in - `AF("af")` - `AZ("az")` - `BS("bs")` - `CS("cs")` - `CY("cy")` - `DA("da")` - `DE("de")` - `EN("en")` - `ES("es")` - `ET("et")` - `FR("fr")` - `GA("ga")` - `HR("hr")` - `HU("hu")` - `ID("id")` - `IS("is")` - `IT("it")` - `KU("ku")` - `LA("la")` - `LT("lt")` - `LV("lv")` - `MI("mi")` - `MS("ms")` - `MT("mt")` - `NL("nl")` - `NO("no")` - `OC("oc")` - `PI("pi")` - `PL("pl")` - `PT("pt")` - `RO("ro")` - `RS_LATIN("rs_latin")` - `SK("sk")` - `SL("sl")` - `SQ("sq")` - `SV("sv")` - `SW("sw")` - `TL("tl")` - `TR("tr")` - `UZ("uz")` - `VI("vi")` - `AR("ar")` - `FA("fa")` - `UG("ug")` - `UR("ur")` - `BN("bn")` - `AS("as")` - `MNI("mni")` - `RU("ru")` - `RS_CYRILLIC("rs_cyrillic")` - `BE("be")` - `BG("bg")` - `UK("uk")` - `MN("mn")` - `ABQ("abq")` - `ADY("ady")` - `KBD("kbd")` - `AVA("ava")` - `DAR("dar")` - `INH("inh")` - `CHE("che")` - `LBE("lbe")` - `LEZ("lez")` - `TAB("tab")` - `TJK("tjk")` - `HI("hi")` - `MR("mr")` - `NE("ne")` - `BH("bh")` - `MAI("mai")` - `ANG("ang")` - `BHO("bho")` - `MAH("mah")` - `SCK("sck")` - `NEW("new")` - `GOM("gom")` - `SA("sa")` - `BGC("bgc")` - `TH("th")` - `CH_SIM("ch_sim")` - `CH_TRA("ch_tra")` - `JA("ja")` - `KO("ko")` - `TA("ta")` - `TE("te")` - `KN("kn")` - `Optional maxPages` The maximum number of pages to parse - `Optional> targetPages` The pages to target for parsing (0-indexed, so first page is at 0) - `Optional updatedAt` Update datetime - `Optional continueAsNewThreshold` Maximum files to process per execution cycle in directory mode. Defaults to page_size. - `Optional directoryId` ID of the directory containing files to process - `Optional> itemIds` List of specific item IDs to process. Either this or directory_id must be provided. - `Optional pageSize` Number of files to process per batch when using directory mode ### Returns - `class BatchCreateResponse:` Response schema for a batch processing job. - `String id` Unique identifier for the batch job - `JobType jobType` Type of processing operation (parse or classify) - `PARSE("parse")` - `EXTRACT("extract")` - `CLASSIFY("classify")` - `String projectId` Project this job belongs to - `Status status` Current job status - `PENDING("pending")` - `RUNNING("running")` - `DISPATCHED("dispatched")` - `COMPLETED("completed")` - `FAILED("failed")` - `CANCELLED("cancelled")` - `long totalItems` Total number of items in the job - `Optional completedAt` Timestamp when job completed - `Optional createdAt` Creation datetime - `Optional directoryId` Directory being processed - `Optional effectiveAt` - `Optional errorMessage` Error message for the latest job attempt, if any. - `Optional failedItems` Number of items that failed processing - `Optional jobRecordId` The job record ID associated with this status, if any. - `Optional processedItems` Number of items processed so far - `Optional skippedItems` Number of items skipped (already processed or size limit) - `Optional startedAt` Timestamp when job processing started - `Optional updatedAt` Update datetime - `Optional workflowId` Async job tracking ID ### Example ```java package com.llamacloud_prod.api.example; import com.llamacloud_prod.api.client.LlamaCloudClient; import com.llamacloud_prod.api.client.okhttp.LlamaCloudOkHttpClient; import com.llamacloud_prod.api.models.beta.batch.BatchCreateParams; import com.llamacloud_prod.api.models.beta.batch.BatchCreateResponse; public final class Main { private Main() {} public static void main(String[] args) { LlamaCloudClient client = LlamaCloudOkHttpClient.fromEnv(); BatchCreateParams params = BatchCreateParams.builder() .jobConfig(BatchCreateParams.JobConfig.BatchParseJobRecordCreate.builder().build()) .build(); BatchCreateResponse batch = client.beta().batch().create(params); } } ``` #### Response ```json { "id": "bjb-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "job_type": "parse", "project_id": "proj-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "status": "pending", "total_items": 0, "completed_at": "2019-12-27T18:11:19.117Z", "created_at": "2019-12-27T18:11:19.117Z", "directory_id": "dir-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "effective_at": "2019-12-27T18:11:19.117Z", "error_message": "error_message", "failed_items": 0, "job_record_id": "job_record_id", "processed_items": 0, "skipped_items": 0, "started_at": "2019-12-27T18:11:19.117Z", "updated_at": "2019-12-27T18:11:19.117Z", "workflow_id": "workflow_id" } ``` ## List Batch Jobs `BatchListPage beta().batch().list(BatchListParamsparams = BatchListParams.none(), RequestOptionsrequestOptions = RequestOptions.none())` **get** `/api/v1/beta/batch-processing` List batch processing jobs with optional filtering. Filter by `directory_id`, `job_type`, or `status`. Results are paginated with configurable `limit` and `offset`. ### Parameters - `BatchListParams params` - `Optional directoryId` Filter by directory ID - `Optional jobType` Filter by job type (PARSE, EXTRACT, CLASSIFY) - `PARSE("parse")` - `EXTRACT("extract")` - `CLASSIFY("classify")` - `Optional limit` Maximum number of jobs to return - `Optional offset` Number of jobs to skip for pagination - `Optional organizationId` - `Optional projectId` - `Optional status` Filter by job status (PENDING, RUNNING, COMPLETED, FAILED, CANCELLED) - `PENDING("pending")` - `RUNNING("running")` - `DISPATCHED("dispatched")` - `COMPLETED("completed")` - `FAILED("failed")` - `CANCELLED("cancelled")` ### Returns - `class BatchListResponse:` Response schema for a batch processing job. - `String id` Unique identifier for the batch job - `JobType jobType` Type of processing operation (parse or classify) - `PARSE("parse")` - `EXTRACT("extract")` - `CLASSIFY("classify")` - `String projectId` Project this job belongs to - `Status status` Current job status - `PENDING("pending")` - `RUNNING("running")` - `DISPATCHED("dispatched")` - `COMPLETED("completed")` - `FAILED("failed")` - `CANCELLED("cancelled")` - `long totalItems` Total number of items in the job - `Optional completedAt` Timestamp when job completed - `Optional createdAt` Creation datetime - `Optional directoryId` Directory being processed - `Optional effectiveAt` - `Optional errorMessage` Error message for the latest job attempt, if any. - `Optional failedItems` Number of items that failed processing - `Optional jobRecordId` The job record ID associated with this status, if any. - `Optional processedItems` Number of items processed so far - `Optional skippedItems` Number of items skipped (already processed or size limit) - `Optional startedAt` Timestamp when job processing started - `Optional updatedAt` Update datetime - `Optional workflowId` Async job tracking ID ### Example ```java package com.llamacloud_prod.api.example; import com.llamacloud_prod.api.client.LlamaCloudClient; import com.llamacloud_prod.api.client.okhttp.LlamaCloudOkHttpClient; import com.llamacloud_prod.api.models.beta.batch.BatchListPage; import com.llamacloud_prod.api.models.beta.batch.BatchListParams; public final class Main { private Main() {} public static void main(String[] args) { LlamaCloudClient client = LlamaCloudOkHttpClient.fromEnv(); BatchListPage page = client.beta().batch().list(); } } ``` #### Response ```json { "items": [ { "id": "bjb-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "job_type": "parse", "project_id": "proj-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "status": "pending", "total_items": 0, "completed_at": "2019-12-27T18:11:19.117Z", "created_at": "2019-12-27T18:11:19.117Z", "directory_id": "dir-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "effective_at": "2019-12-27T18:11:19.117Z", "error_message": "error_message", "failed_items": 0, "job_record_id": "job_record_id", "processed_items": 0, "skipped_items": 0, "started_at": "2019-12-27T18:11:19.117Z", "updated_at": "2019-12-27T18:11:19.117Z", "workflow_id": "workflow_id" } ], "next_page_token": "next_page_token", "total_size": 0 } ``` ## Get Batch Job Status `BatchGetStatusResponse beta().batch().getStatus(BatchGetStatusParamsparams = BatchGetStatusParams.none(), RequestOptionsrequestOptions = RequestOptions.none())` **get** `/api/v1/beta/batch-processing/{job_id}` Get detailed status of a batch processing job. Returns current progress percentage, file counts (total, processed, failed, skipped), and timestamps. ### Parameters - `BatchGetStatusParams params` - `Optional jobId` - `Optional organizationId` - `Optional projectId` ### Returns - `class BatchGetStatusResponse:` Detailed status response for a batch processing job. - `Job job` Response schema for a batch processing job. - `String id` Unique identifier for the batch job - `JobType jobType` Type of processing operation (parse or classify) - `PARSE("parse")` - `EXTRACT("extract")` - `CLASSIFY("classify")` - `String projectId` Project this job belongs to - `Status status` Current job status - `PENDING("pending")` - `RUNNING("running")` - `DISPATCHED("dispatched")` - `COMPLETED("completed")` - `FAILED("failed")` - `CANCELLED("cancelled")` - `long totalItems` Total number of items in the job - `Optional completedAt` Timestamp when job completed - `Optional createdAt` Creation datetime - `Optional directoryId` Directory being processed - `Optional effectiveAt` - `Optional errorMessage` Error message for the latest job attempt, if any. - `Optional failedItems` Number of items that failed processing - `Optional jobRecordId` The job record ID associated with this status, if any. - `Optional processedItems` Number of items processed so far - `Optional skippedItems` Number of items skipped (already processed or size limit) - `Optional startedAt` Timestamp when job processing started - `Optional updatedAt` Update datetime - `Optional workflowId` Async job tracking ID - `double progressPercentage` Percentage of items processed (0-100) ### Example ```java package com.llamacloud_prod.api.example; import com.llamacloud_prod.api.client.LlamaCloudClient; import com.llamacloud_prod.api.client.okhttp.LlamaCloudOkHttpClient; import com.llamacloud_prod.api.models.beta.batch.BatchGetStatusParams; import com.llamacloud_prod.api.models.beta.batch.BatchGetStatusResponse; public final class Main { private Main() {} public static void main(String[] args) { LlamaCloudClient client = LlamaCloudOkHttpClient.fromEnv(); BatchGetStatusResponse response = client.beta().batch().getStatus("job_id"); } } ``` #### Response ```json { "job": { "id": "bjb-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "job_type": "parse", "project_id": "proj-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "status": "pending", "total_items": 0, "completed_at": "2019-12-27T18:11:19.117Z", "created_at": "2019-12-27T18:11:19.117Z", "directory_id": "dir-aaaaaaaa-bbbb-cccc-dddd-eeeeeeeeeeee", "effective_at": "2019-12-27T18:11:19.117Z", "error_message": "error_message", "failed_items": 0, "job_record_id": "job_record_id", "processed_items": 0, "skipped_items": 0, "started_at": "2019-12-27T18:11:19.117Z", "updated_at": "2019-12-27T18:11:19.117Z", "workflow_id": "workflow_id" }, "progress_percentage": 0 } ``` ## Cancel Batch Job `BatchCancelResponse beta().batch().cancel(BatchCancelParamsparams = BatchCancelParams.none(), RequestOptionsrequestOptions = RequestOptions.none())` **post** `/api/v1/beta/batch-processing/{job_id}/cancel` Cancel a running batch processing job. Stops processing and marks pending items as cancelled. Items currently being processed may still complete. ### Parameters - `BatchCancelParams params` - `Optional jobId` - `Optional organizationId` - `Optional projectId` - `Optional temporalNamespace` - `Optional reason` Optional reason for cancelling the job ### Returns - `class BatchCancelResponse:` Response after cancelling a batch job. - `String jobId` ID of the cancelled job - `String message` Confirmation message - `long processedItems` Number of items processed before cancellation - `Status status` New status (should be 'cancelled') - `PENDING("pending")` - `RUNNING("running")` - `DISPATCHED("dispatched")` - `COMPLETED("completed")` - `FAILED("failed")` - `CANCELLED("cancelled")` ### Example ```java package com.llamacloud_prod.api.example; import com.llamacloud_prod.api.client.LlamaCloudClient; import com.llamacloud_prod.api.client.okhttp.LlamaCloudOkHttpClient; import com.llamacloud_prod.api.models.beta.batch.BatchCancelParams; import com.llamacloud_prod.api.models.beta.batch.BatchCancelResponse; public final class Main { private Main() {} public static void main(String[] args) { LlamaCloudClient client = LlamaCloudOkHttpClient.fromEnv(); BatchCancelResponse response = client.beta().batch().cancel("job_id"); } } ``` #### Response ```json { "job_id": "job_id", "message": "message", "processed_items": 0, "status": "pending" } ``` # Job Items ## List Batch Job Items `JobItemListPage beta().batch().jobItems().list(JobItemListParamsparams = JobItemListParams.none(), RequestOptionsrequestOptions = RequestOptions.none())` **get** `/api/v1/beta/batch-processing/{job_id}/items` List items in a batch job with optional status filtering. Useful for finding failed items, viewing completed items, or debugging processing issues. ### Parameters - `JobItemListParams params` - `Optional jobId` - `Optional limit` Maximum number of items to return - `Optional offset` Number of items to skip - `Optional organizationId` - `Optional projectId` - `Optional status` Filter items by status - `PENDING("pending")` - `PROCESSING("processing")` - `COMPLETED("completed")` - `FAILED("failed")` - `SKIPPED("skipped")` - `CANCELLED("cancelled")` ### Returns - `class JobItemListResponse:` Detailed information about an item in a batch job. - `String itemId` ID of the item - `String itemName` Name of the item - `Status status` Processing status of this item - `PENDING("pending")` - `PROCESSING("processing")` - `COMPLETED("completed")` - `FAILED("failed")` - `SKIPPED("skipped")` - `CANCELLED("cancelled")` - `Optional completedAt` When processing completed for this item - `Optional effectiveAt` - `Optional errorMessage` Error message for the latest job attempt, if any. - `Optional jobId` Job ID for the underlying processing job (links to parse/extract job results) - `Optional jobRecordId` The job record ID associated with this status, if any. - `Optional skipReason` Reason item was skipped (e.g., 'already_processed', 'size_limit_exceeded') - `Optional startedAt` When processing started for this item ### Example ```java package com.llamacloud_prod.api.example; import com.llamacloud_prod.api.client.LlamaCloudClient; import com.llamacloud_prod.api.client.okhttp.LlamaCloudOkHttpClient; import com.llamacloud_prod.api.models.beta.batch.jobitems.JobItemListPage; import com.llamacloud_prod.api.models.beta.batch.jobitems.JobItemListParams; public final class Main { private Main() {} public static void main(String[] args) { LlamaCloudClient client = LlamaCloudOkHttpClient.fromEnv(); JobItemListPage page = client.beta().batch().jobItems().list("job_id"); } } ``` #### Response ```json { "items": [ { "item_id": "item_id", "item_name": "item_name", "status": "pending", "completed_at": "2019-12-27T18:11:19.117Z", "effective_at": "2019-12-27T18:11:19.117Z", "error_message": "error_message", "job_id": "job_id", "job_record_id": "job_record_id", "skip_reason": "skip_reason", "started_at": "2019-12-27T18:11:19.117Z" } ], "next_page_token": "next_page_token", "total_size": 0 } ``` ## Get Item Processing Results `JobItemGetProcessingResultsResponse beta().batch().jobItems().getProcessingResults(JobItemGetProcessingResultsParamsparams = JobItemGetProcessingResultsParams.none(), RequestOptionsrequestOptions = RequestOptions.none())` **get** `/api/v1/beta/batch-processing/items/{item_id}/processing-results` Get all processing results for a specific item. Returns the complete processing history for an item including what operations were performed, parameters used, and where outputs are stored. Optionally filter by `job_type`. ### Parameters - `JobItemGetProcessingResultsParams params` - `Optional itemId` - `Optional jobType` Filter results by job type - `PARSE("parse")` - `EXTRACT("extract")` - `CLASSIFY("classify")` - `Optional organizationId` - `Optional projectId` ### Returns - `class JobItemGetProcessingResultsResponse:` Response containing all processing results for an item. - `String itemId` ID of the source item - `String itemName` Name of the source item - `Optional> processingResults` List of all processing operations performed on this item - `String itemId` Source item that was processed - `JobConfig jobConfig` Job configuration used for processing - `class BatchParseJobRecordCreate:` Batch-specific parse job record for batch processing. This model contains the metadata and configuration for a batch parse job, but excludes file-specific information. It's used as input to the batch parent workflow and combined with DirectoryFile data to create full ParseJobRecordCreate instances for each file. Attributes: job_name: Must be PARSE_RAW_FILE partitions: Partitions for job output location parameters: Generic parse configuration (BatchParseJobConfig) session_id: Upstream request ID for tracking correlation_id: Correlation ID for cross-service tracking parent_job_execution_id: Parent job execution ID if nested user_id: User who created the job project_id: Project this job belongs to webhook_url: Optional webhook URL for job completion notifications - `Optional correlationId` The correlation ID for this job. Used for tracking the job across services. - `Optional jobName` - `PARSE_RAW_FILE_JOB("parse_raw_file_job")` - `Optional parameters` Generic parse job configuration for batch processing. This model contains the parsing configuration that applies to all files in a batch, but excludes file-specific fields like file_name, file_id, etc. Those file-specific fields are populated from DirectoryFile data when creating individual ParseJobRecordCreate instances for each file. The fields in this model should be generic settings that apply uniformly to all files being processed in the batch. - `Optional adaptiveLongTable` - `Optional aggressiveTableExtraction` - `Optional annotateLinks` - `Optional autoMode` - `Optional autoModeConfigurationJson` - `Optional autoModeTriggerOnImageInPage` - `Optional autoModeTriggerOnRegexpInPage` - `Optional autoModeTriggerOnTableInPage` - `Optional autoModeTriggerOnTextInPage` - `Optional azureOpenAIApiVersion` - `Optional azureOpenAIDeploymentName` - `Optional azureOpenAIEndpoint` - `Optional azureOpenAIKey` - `Optional bboxBottom` - `Optional bboxLeft` - `Optional bboxRight` - `Optional bboxTop` - `Optional boundingBox` - `Optional compactMarkdownTable` - `Optional complementalFormattingInstruction` - `Optional contentGuidelineInstruction` - `Optional continuousMode` - `Optional customMetadata` The custom metadata to attach to the documents. - `Optional disableImageExtraction` - `Optional disableOcr` - `Optional disableReconstruction` - `Optional doNotCache` - `Optional doNotUnrollColumns` - `Optional enableCostOptimizer` - `Optional extractCharts` - `Optional extractLayout` - `Optional extractPrintedPageNumber` - `Optional fastMode` - `Optional formattingInstruction` - `Optional gpt4oApiKey` - `Optional gpt4oMode` - `Optional guessXlsxSheetName` - `Optional hideFooters` - `Optional hideHeaders` - `Optional highResOcr` - `Optional htmlMakeAllElementsVisible` - `Optional htmlRemoveFixedElements` - `Optional htmlRemoveNavigationElements` - `Optional httpProxy` - `Optional ignoreDocumentElementsForLayoutDetection` - `Optional> imagesToSave` - `SCREENSHOT("screenshot")` - `EMBEDDED("embedded")` - `LAYOUT("layout")` - `Optional inlineImagesInMarkdown` - `Optional inputS3Path` - `Optional inputS3Region` The region for the input S3 bucket. - `Optional inputUrl` - `Optional internalIsScreenshotJob` - `Optional invalidateCache` - `Optional isFormattingInstruction` - `Optional jobTimeoutExtraTimePerPageInSeconds` - `Optional jobTimeoutInSeconds` - `Optional keepPageSeparatorWhenMergingTables` - `Optional lang` The language. - `Optional> languages` - `AF("af")` - `AZ("az")` - `BS("bs")` - `CS("cs")` - `CY("cy")` - `DA("da")` - `DE("de")` - `EN("en")` - `ES("es")` - `ET("et")` - `FR("fr")` - `GA("ga")` - `HR("hr")` - `HU("hu")` - `ID("id")` - `IS("is")` - `IT("it")` - `KU("ku")` - `LA("la")` - `LT("lt")` - `LV("lv")` - `MI("mi")` - `MS("ms")` - `MT("mt")` - `NL("nl")` - `NO("no")` - `OC("oc")` - `PI("pi")` - `PL("pl")` - `PT("pt")` - `RO("ro")` - `RS_LATIN("rs_latin")` - `SK("sk")` - `SL("sl")` - `SQ("sq")` - `SV("sv")` - `SW("sw")` - `TL("tl")` - `TR("tr")` - `UZ("uz")` - `VI("vi")` - `AR("ar")` - `FA("fa")` - `UG("ug")` - `UR("ur")` - `BN("bn")` - `AS("as")` - `MNI("mni")` - `RU("ru")` - `RS_CYRILLIC("rs_cyrillic")` - `BE("be")` - `BG("bg")` - `UK("uk")` - `MN("mn")` - `ABQ("abq")` - `ADY("ady")` - `KBD("kbd")` - `AVA("ava")` - `DAR("dar")` - `INH("inh")` - `CHE("che")` - `LBE("lbe")` - `LEZ("lez")` - `TAB("tab")` - `TJK("tjk")` - `HI("hi")` - `MR("mr")` - `NE("ne")` - `BH("bh")` - `MAI("mai")` - `ANG("ang")` - `BHO("bho")` - `MAH("mah")` - `SCK("sck")` - `NEW("new")` - `GOM("gom")` - `SA("sa")` - `BGC("bgc")` - `TH("th")` - `CH_SIM("ch_sim")` - `CH_TRA("ch_tra")` - `JA("ja")` - `KO("ko")` - `TA("ta")` - `TE("te")` - `KN("kn")` - `Optional layoutAware` - `Optional lineLevelBoundingBox` - `Optional markdownTableMultilineHeaderSeparator` - `Optional maxPages` - `Optional maxPagesEnforced` - `Optional mergeTablesAcrossPagesInMarkdown` - `Optional model` - `Optional outlinedTableExtraction` - `Optional outputPdfOfDocument` - `Optional outputS3PathPrefix` If specified, llamaParse will save the output to the specified path. All output file will use this 'prefix' should be a valid s3:// url - `Optional outputS3Region` The region for the output S3 bucket. - `Optional outputTablesAsHtml` - `Optional outputBucket` The output bucket. - `Optional pageErrorTolerance` - `Optional pageFooterPrefix` - `Optional pageFooterSuffix` - `Optional pageHeaderPrefix` - `Optional pageHeaderSuffix` - `Optional pagePrefix` - `Optional pageSeparator` - `Optional pageSuffix` - `Optional parseMode` Enum for representing the mode of parsing to be used. - `PARSE_PAGE_WITHOUT_LLM("parse_page_without_llm")` - `PARSE_PAGE_WITH_LLM("parse_page_with_llm")` - `PARSE_PAGE_WITH_LVM("parse_page_with_lvm")` - `PARSE_PAGE_WITH_AGENT("parse_page_with_agent")` - `PARSE_PAGE_WITH_LAYOUT_AGENT("parse_page_with_layout_agent")` - `PARSE_DOCUMENT_WITH_LLM("parse_document_with_llm")` - `PARSE_DOCUMENT_WITH_LVM("parse_document_with_lvm")` - `PARSE_DOCUMENT_WITH_AGENT("parse_document_with_agent")` - `Optional parsingInstruction` - `Optional pipelineId` The pipeline ID. - `Optional preciseBoundingBox` - `Optional premiumMode` - `Optional presentationOutOfBoundsContent` - `Optional presentationSkipEmbeddedData` - `Optional preserveLayoutAlignmentAcrossPages` - `Optional preserveVerySmallText` - `Optional preset` - `Optional priority` The priority for the request. This field may be ignored or overwritten depending on the organization tier. - `LOW("low")` - `MEDIUM("medium")` - `HIGH("high")` - `CRITICAL("critical")` - `Optional projectId` - `Optional removeHiddenText` - `Optional replaceFailedPageMode` Enum for representing the different available page error handling modes. - `RAW_TEXT("raw_text")` - `BLANK_PAGE("blank_page")` - `ERROR_MESSAGE("error_message")` - `Optional replaceFailedPageWithErrorMessagePrefix` - `Optional replaceFailedPageWithErrorMessageSuffix` - `Optional resourceInfo` The resource info about the file - `Optional saveImages` - `Optional skipDiagonalText` - `Optional specializedChartParsingAgentic` - `Optional specializedChartParsingEfficient` - `Optional specializedChartParsingPlus` - `Optional specializedImageParsing` - `Optional spreadsheetExtractSubTables` - `Optional spreadsheetForceFormulaComputation` - `Optional spreadsheetIncludeHiddenSheets` - `Optional strictModeBuggyFont` - `Optional strictModeImageExtraction` - `Optional strictModeImageOcr` - `Optional strictModeReconstruction` - `Optional structuredOutput` - `Optional structuredOutputJsonSchema` - `Optional structuredOutputJsonSchemaName` - `Optional systemPrompt` - `Optional systemPromptAppend` - `Optional takeScreenshot` - `Optional targetPages` - `Optional tier` - `Optional type` - `PARSE("parse")` - `Optional useVendorMultimodalModel` - `Optional userPrompt` - `Optional vendorMultimodalApiKey` - `Optional vendorMultimodalModelName` - `Optional version` - `Optional> webhookConfigurations` Outbound webhook endpoints to notify on job status changes - `Optional> webhookEvents` Events to subscribe to (e.g. 'parse.success', 'extract.error'). If null, all events are delivered. - `EXTRACT_PENDING("extract.pending")` - `EXTRACT_SUCCESS("extract.success")` - `EXTRACT_ERROR("extract.error")` - `EXTRACT_PARTIAL_SUCCESS("extract.partial_success")` - `EXTRACT_CANCELLED("extract.cancelled")` - `PARSE_PENDING("parse.pending")` - `PARSE_RUNNING("parse.running")` - `PARSE_SUCCESS("parse.success")` - `PARSE_ERROR("parse.error")` - `PARSE_PARTIAL_SUCCESS("parse.partial_success")` - `PARSE_CANCELLED("parse.cancelled")` - `CLASSIFY_PENDING("classify.pending")` - `CLASSIFY_RUNNING("classify.running")` - `CLASSIFY_SUCCESS("classify.success")` - `CLASSIFY_ERROR("classify.error")` - `CLASSIFY_PARTIAL_SUCCESS("classify.partial_success")` - `CLASSIFY_CANCELLED("classify.cancelled")` - `SHEETS_PENDING("sheets.pending")` - `SHEETS_SUCCESS("sheets.success")` - `SHEETS_ERROR("sheets.error")` - `SHEETS_PARTIAL_SUCCESS("sheets.partial_success")` - `SHEETS_CANCELLED("sheets.cancelled")` - `UNMAPPED_EVENT("unmapped_event")` - `Optional webhookHeaders` Custom HTTP headers sent with each webhook request (e.g. auth tokens) - `Optional webhookOutputFormat` Response format sent to the webhook: 'string' (default) or 'json' - `Optional webhookUrl` URL to receive webhook POST notifications - `Optional webhookUrl` - `Optional parentJobExecutionId` The ID of the parent job execution. - `Optional partitions` The partitions for this execution. Used for determining where to save job output. - `Optional projectId` The ID of the project this job belongs to. - `Optional sessionId` The upstream request ID that created this job. Used for tracking the job across services. - `Optional userId` The ID of the user that created this job - `Optional webhookUrl` The URL that needs to be called at the end of the parsing job. - `class ClassifyJob:` A classify job. - `String id` Unique identifier - `String projectId` The ID of the project - `List rules` The rules to classify the files - `String description` Natural language description of what to classify. Be specific about the content characteristics that identify this document type. - `String type` The document type to assign when this rule matches (e.g., 'invoice', 'receipt', 'contract') - `StatusEnum status` The status of the classify job - `PENDING("PENDING")` - `SUCCESS("SUCCESS")` - `ERROR("ERROR")` - `PARTIAL_SUCCESS("PARTIAL_SUCCESS")` - `CANCELLED("CANCELLED")` - `String userId` The ID of the user - `Optional createdAt` Creation datetime - `Optional effectiveAt` - `Optional errorMessage` Error message for the latest job attempt, if any. - `Optional jobRecordId` The job record ID associated with this status, if any. - `Optional mode` The classification mode to use - `FAST("FAST")` - `MULTIMODAL("MULTIMODAL")` - `Optional parsingConfiguration` The configuration for the parsing job - `Optional lang` The language to parse the files in - `AF("af")` - `AZ("az")` - `BS("bs")` - `CS("cs")` - `CY("cy")` - `DA("da")` - `DE("de")` - `EN("en")` - `ES("es")` - `ET("et")` - `FR("fr")` - `GA("ga")` - `HR("hr")` - `HU("hu")` - `ID("id")` - `IS("is")` - `IT("it")` - `KU("ku")` - `LA("la")` - `LT("lt")` - `LV("lv")` - `MI("mi")` - `MS("ms")` - `MT("mt")` - `NL("nl")` - `NO("no")` - `OC("oc")` - `PI("pi")` - `PL("pl")` - `PT("pt")` - `RO("ro")` - `RS_LATIN("rs_latin")` - `SK("sk")` - `SL("sl")` - `SQ("sq")` - `SV("sv")` - `SW("sw")` - `TL("tl")` - `TR("tr")` - `UZ("uz")` - `VI("vi")` - `AR("ar")` - `FA("fa")` - `UG("ug")` - `UR("ur")` - `BN("bn")` - `AS("as")` - `MNI("mni")` - `RU("ru")` - `RS_CYRILLIC("rs_cyrillic")` - `BE("be")` - `BG("bg")` - `UK("uk")` - `MN("mn")` - `ABQ("abq")` - `ADY("ady")` - `KBD("kbd")` - `AVA("ava")` - `DAR("dar")` - `INH("inh")` - `CHE("che")` - `LBE("lbe")` - `LEZ("lez")` - `TAB("tab")` - `TJK("tjk")` - `HI("hi")` - `MR("mr")` - `NE("ne")` - `BH("bh")` - `MAI("mai")` - `ANG("ang")` - `BHO("bho")` - `MAH("mah")` - `SCK("sck")` - `NEW("new")` - `GOM("gom")` - `SA("sa")` - `BGC("bgc")` - `TH("th")` - `CH_SIM("ch_sim")` - `CH_TRA("ch_tra")` - `JA("ja")` - `KO("ko")` - `TA("ta")` - `TE("te")` - `KN("kn")` - `Optional maxPages` The maximum number of pages to parse - `Optional> targetPages` The pages to target for parsing (0-indexed, so first page is at 0) - `Optional updatedAt` Update datetime - `JobType jobType` Type of processing performed - `PARSE("parse")` - `EXTRACT("extract")` - `CLASSIFY("classify")` - `String outputS3Path` Location of the processing output - `String parametersHash` Content hash of the job configuration for dedup - `LocalDateTime processedAt` When this processing occurred - `String resultId` Unique identifier for this result - `Optional outputMetadata` Metadata about processing output. Currently empty - will be populated with job-type-specific metadata fields in the future. ### Example ```java package com.llamacloud_prod.api.example; import com.llamacloud_prod.api.client.LlamaCloudClient; import com.llamacloud_prod.api.client.okhttp.LlamaCloudOkHttpClient; import com.llamacloud_prod.api.models.beta.batch.jobitems.JobItemGetProcessingResultsParams; import com.llamacloud_prod.api.models.beta.batch.jobitems.JobItemGetProcessingResultsResponse; public final class Main { private Main() {} public static void main(String[] args) { LlamaCloudClient client = LlamaCloudOkHttpClient.fromEnv(); JobItemGetProcessingResultsResponse response = client.beta().batch().jobItems().getProcessingResults("item_id"); } } ``` #### Response ```json { "item_id": "item_id", "item_name": "item_name", "processing_results": [ { "item_id": "item_id", "job_config": { "correlation_id": "182bd5e5-6e1a-4fe4-a799-aa6d9a6ab26e", "job_name": "parse_raw_file_job", "parameters": { "adaptive_long_table": true, "aggressive_table_extraction": true, "annotate_links": true, "auto_mode": true, "auto_mode_configuration_json": "auto_mode_configuration_json", "auto_mode_trigger_on_image_in_page": true, "auto_mode_trigger_on_regexp_in_page": "auto_mode_trigger_on_regexp_in_page", "auto_mode_trigger_on_table_in_page": true, "auto_mode_trigger_on_text_in_page": "auto_mode_trigger_on_text_in_page", "azure_openai_api_version": "azure_openai_api_version", "azure_openai_deployment_name": "azure_openai_deployment_name", "azure_openai_endpoint": "azure_openai_endpoint", "azure_openai_key": "azure_openai_key", "bbox_bottom": 0, "bbox_left": 0, "bbox_right": 0, "bbox_top": 0, "bounding_box": "bounding_box", "compact_markdown_table": true, "complemental_formatting_instruction": "complemental_formatting_instruction", "content_guideline_instruction": "content_guideline_instruction", "continuous_mode": true, "custom_metadata": { "foo": "bar" }, "disable_image_extraction": true, "disable_ocr": true, "disable_reconstruction": true, "do_not_cache": true, "do_not_unroll_columns": true, "enable_cost_optimizer": true, "extract_charts": true, "extract_layout": true, "extract_printed_page_number": true, "fast_mode": true, "formatting_instruction": "formatting_instruction", "gpt4o_api_key": "gpt4o_api_key", "gpt4o_mode": true, "guess_xlsx_sheet_name": true, "hide_footers": true, "hide_headers": true, "high_res_ocr": true, "html_make_all_elements_visible": true, "html_remove_fixed_elements": true, "html_remove_navigation_elements": true, "http_proxy": "http_proxy", "ignore_document_elements_for_layout_detection": true, "images_to_save": [ "screenshot" ], "inline_images_in_markdown": true, "input_s3_path": "input_s3_path", "input_s3_region": "input_s3_region", "input_url": "input_url", "internal_is_screenshot_job": true, "invalidate_cache": true, "is_formatting_instruction": true, "job_timeout_extra_time_per_page_in_seconds": 0, "job_timeout_in_seconds": 0, "keep_page_separator_when_merging_tables": true, "lang": "lang", "languages": [ "af" ], "layout_aware": true, "line_level_bounding_box": true, "markdown_table_multiline_header_separator": "markdown_table_multiline_header_separator", "max_pages": 0, "max_pages_enforced": 0, "merge_tables_across_pages_in_markdown": true, "model": "model", "outlined_table_extraction": true, "output_pdf_of_document": true, "output_s3_path_prefix": "output_s3_path_prefix", "output_s3_region": "output_s3_region", "output_tables_as_HTML": true, "outputBucket": "outputBucket", "page_error_tolerance": 0, "page_footer_prefix": "page_footer_prefix", "page_footer_suffix": "page_footer_suffix", "page_header_prefix": "page_header_prefix", "page_header_suffix": "page_header_suffix", "page_prefix": "page_prefix", "page_separator": "page_separator", "page_suffix": "page_suffix", "parse_mode": "parse_page_without_llm", "parsing_instruction": "parsing_instruction", "pipeline_id": "pipeline_id", "precise_bounding_box": true, "premium_mode": true, "presentation_out_of_bounds_content": true, "presentation_skip_embedded_data": true, "preserve_layout_alignment_across_pages": true, "preserve_very_small_text": true, "preset": "preset", "priority": "low", "project_id": "project_id", "remove_hidden_text": true, "replace_failed_page_mode": "raw_text", "replace_failed_page_with_error_message_prefix": "replace_failed_page_with_error_message_prefix", "replace_failed_page_with_error_message_suffix": "replace_failed_page_with_error_message_suffix", "resource_info": { "foo": "bar" }, "save_images": true, "skip_diagonal_text": true, "specialized_chart_parsing_agentic": true, "specialized_chart_parsing_efficient": true, "specialized_chart_parsing_plus": true, "specialized_image_parsing": true, "spreadsheet_extract_sub_tables": true, "spreadsheet_force_formula_computation": true, "spreadsheet_include_hidden_sheets": true, "strict_mode_buggy_font": true, "strict_mode_image_extraction": true, "strict_mode_image_ocr": true, "strict_mode_reconstruction": true, "structured_output": true, "structured_output_json_schema": "structured_output_json_schema", "structured_output_json_schema_name": "structured_output_json_schema_name", "system_prompt": "system_prompt", "system_prompt_append": "system_prompt_append", "take_screenshot": true, "target_pages": "target_pages", "tier": "tier", "type": "parse", "use_vendor_multimodal_model": true, "user_prompt": "user_prompt", "vendor_multimodal_api_key": "vendor_multimodal_api_key", "vendor_multimodal_model_name": "vendor_multimodal_model_name", "version": "version", "webhook_configurations": [ { "webhook_events": [ "parse.success", "parse.error" ], "webhook_headers": { "Authorization": "Bearer sk-..." }, "webhook_output_format": "json", "webhook_url": "https://example.com/webhooks/llamacloud" } ], "webhook_url": "webhook_url" }, "parent_job_execution_id": "182bd5e5-6e1a-4fe4-a799-aa6d9a6ab26e", "partitions": { "foo": "182bd5e5-6e1a-4fe4-a799-aa6d9a6ab26e" }, "project_id": "182bd5e5-6e1a-4fe4-a799-aa6d9a6ab26e", "session_id": "182bd5e5-6e1a-4fe4-a799-aa6d9a6ab26e", "user_id": "user_id", "webhook_url": "webhook_url" }, "job_type": "parse", "output_s3_path": "output_s3_path", "parameters_hash": "parameters_hash", "processed_at": "2019-12-27T18:11:19.117Z", "result_id": "result_id", "output_metadata": {} } ] } ```