Skip to content
Get started

Parsing

Parse File
POST/api/v2/parse
Get Parse Job
GET/api/v2/parse/{job_id}
List Parse Jobs
GET/api/v2/parse
ModelsExpand Collapse
BBox = object { h, w, x, 5 more }

Bounding box with coordinates and optional metadata.

h: number

Height of the bounding box

w: number

Width of the bounding box

x: number

X coordinate of the bounding box

y: number

Y coordinate of the bounding box

confidence: optional number

Confidence score

end_index: optional number

End index in the text

label: optional string

Label for the bounding box

start_index: optional number

Start index in the text

FailPageMode = "raw_text" or "blank_page" or "error_message"

Enum for representing the different available page error handling modes.

Accepts one of the following:
"raw_text"
"blank_page"
"error_message"
ListItem = object { items, md, ordered, 2 more }
items: array of object { md, value, bbox, type } or ListItem { items, md, ordered, 2 more }

List of nested text or list items

Accepts one of the following:
TextItem = object { md, value, bbox, type }
md: string

Markdown representation preserving formatting

value: string

Text content

bbox: optional array of BBox { h, w, x, 5 more }

List of bounding boxes

h: number

Height of the bounding box

w: number

Width of the bounding box

x: number

X coordinate of the bounding box

y: number

Y coordinate of the bounding box

confidence: optional number

Confidence score

end_index: optional number

End index in the text

label: optional string

Label for the bounding box

start_index: optional number

Start index in the text

type: optional "text"

Text item type

ListItem = object { items, md, ordered, 2 more }
items: array of object { md, value, bbox, type } or ListItem { items, md, ordered, 2 more }

List of nested text or list items

Accepts one of the following:
TextItem = object { md, value, bbox, type }
md: string

Markdown representation preserving formatting

value: string

Text content

bbox: optional array of BBox { h, w, x, 5 more }

List of bounding boxes

h: number

Height of the bounding box

w: number

Width of the bounding box

x: number

X coordinate of the bounding box

y: number

Y coordinate of the bounding box

confidence: optional number

Confidence score

end_index: optional number

End index in the text

label: optional string

Label for the bounding box

start_index: optional number

Start index in the text

type: optional "text"

Text item type

ListItem { items, md, ordered, 2 more }
md: string

Markdown representation preserving formatting

ordered: boolean

Whether the list is ordered or unordered

bbox: optional array of BBox { h, w, x, 5 more }

List of bounding boxes

h: number

Height of the bounding box

w: number

Width of the bounding box

x: number

X coordinate of the bounding box

y: number

Y coordinate of the bounding box

confidence: optional number

Confidence score

end_index: optional number

End index in the text

label: optional string

Label for the bounding box

start_index: optional number

Start index in the text

type: optional "list"

List item type

md: string

Markdown representation preserving formatting

ordered: boolean

Whether the list is ordered or unordered

bbox: optional array of BBox { h, w, x, 5 more }

List of bounding boxes

h: number

Height of the bounding box

w: number

Width of the bounding box

x: number

X coordinate of the bounding box

y: number

Y coordinate of the bounding box

confidence: optional number

Confidence score

end_index: optional number

End index in the text

label: optional string

Label for the bounding box

start_index: optional number

Start index in the text

type: optional "list"

List item type

LlamaParseSupportedFileExtensions = ".pdf" or ".abw" or ".awt" or 141 more

Enum for supported file extensions.

Accepts one of the following:
".pdf"
".abw"
".awt"
".cgm"
".cwk"
".doc"
".docm"
".docx"
".dot"
".dotm"
".dotx"
".fodg"
".fodp"
".fopd"
".fodt"
".fb2"
".hwp"
".lwp"
".mcw"
".mw"
".mwd"
".odf"
".odt"
".otg"
".ott"
".pages"
".pbd"
".psw"
".rtf"
".sda"
".sdd"
".sdp"
".sdw"
".sgl"
".std"
".stw"
".sxd"
".sxg"
".sxm"
".sxw"
".uof"
".uop"
".uot"
".vor"
".wpd"
".wps"
".wpt"
".wri"
".wn"
".xml"
".zabw"
".key"
".odp"
".odg"
".otp"
".pot"
".potm"
".potx"
".ppt"
".pptm"
".pptx"
".sti"
".sxi"
".vsd"
".vsdm"
".vsdx"
".vdx"
".bmp"
".gif"
".jpg"
".jpeg"
".png"
".svg"
".tif"
".tiff"
".webp"
".htm"
".html"
".xhtm"
".csv"
".dbf"
".dif"
".et"
".eth"
".fods"
".numbers"
".ods"
".ots"
".prn"
".qpw"
".slk"
".stc"
".sxc"
".sylk"
".tsv"
".uos1"
".uos2"
".uos"
".wb1"
".wb2"
".wb3"
".wk1"
".wk2"
".wk3"
".wk4"
".wks"
".wq1"
".wq2"
".xlr"
".xls"
".xlsb"
".xlsm"
".xlsx"
".xlw"
".azw"
".azw3"
".azw4"
".cb7"
".cbc"
".cbr"
".cbz"
".chm"
".djvu"
".epub"
".fbz"
".htmlz"
".lit"
".lrf"
".md"
".mobi"
".pdb"
".pml"
".prc"
".rb"
".snb"
".tcr"
".txtz"
".m4a"
".mp3"
".mp4"
".mpeg"
".mpga"
".wav"
".webm"
ParsingJob = object { id, status, error_code, error_message }

Response schema for a parsing job.

id: string
status: StatusEnum

Enum for representing the status of a job

Accepts one of the following:
"PENDING"
"SUCCESS"
"ERROR"
"PARTIAL_SUCCESS"
"CANCELLED"
error_code: optional string
error_message: optional string
ParsingLanguages = "af" or "az" or "bs" or 83 more

Enum for representing the languages supported by the parser.

Accepts one of the following:
"af"
"az"
"bs"
"cs"
"cy"
"da"
"de"
"en"
"es"
"et"
"fr"
"ga"
"hr"
"hu"
"id"
"is"
"it"
"ku"
"la"
"lt"
"lv"
"mi"
"ms"
"mt"
"nl"
"no"
"oc"
"pi"
"pl"
"pt"
"ro"
"rs_latin"
"sk"
"sl"
"sq"
"sv"
"sw"
"tl"
"tr"
"uz"
"vi"
"ar"
"fa"
"ug"
"ur"
"bn"
"as"
"mni"
"ru"
"rs_cyrillic"
"be"
"bg"
"uk"
"mn"
"abq"
"ady"
"kbd"
"ava"
"dar"
"inh"
"che"
"lbe"
"lez"
"tab"
"tjk"
"hi"
"mr"
"ne"
"bh"
"mai"
"ang"
"bho"
"mah"
"sck"
"new"
"gom"
"sa"
"bgc"
"th"
"ch_sim"
"ch_tra"
"ja"
"ko"
"ta"
"te"
"kn"
ParsingMode = "parse_page_without_llm" or "parse_page_with_llm" or "parse_page_with_lvm" or 5 more

Enum for representing the mode of parsing to be used.

Accepts one of the following:
"parse_page_without_llm"
"parse_page_with_llm"
"parse_page_with_lvm"
"parse_page_with_agent"
"parse_page_with_layout_agent"
"parse_document_with_llm"
"parse_document_with_lvm"
"parse_document_with_agent"
StatusEnum = "PENDING" or "SUCCESS" or "ERROR" or 2 more

Enum for representing the status of a job

Accepts one of the following:
"PENDING"
"SUCCESS"
"ERROR"
"PARTIAL_SUCCESS"
"CANCELLED"