Update Task
Updates an existing task’s configuration and reprocesses the document.
Requirements:
- Task must have status
Succeeded
orFailed
- New configuration must be different from the current one
The returned task will typically be in a Starting
or Processing
state.
Use the GET /task/{task_id}
endpoint to poll for completion.
curl --request PATCH \
--url https://api.chunkr.ai/api/v1/task/{task_id}/parse \
--header 'Authorization: <api-key>' \
--header 'Content-Type: application/json' \
--data '{
"chunk_processing": null,
"expires_in": 123,
"high_resolution": true,
"ocr_strategy": null,
"pipeline": null,
"segment_processing": null,
"segmentation_strategy": null
}'
{
"configuration": {
"chunk_processing": {
"ignore_headers_and_footers": true,
"target_length": 512
},
"expires_in": 123,
"high_resolution": true,
"input_file_url": "<string>",
"json_schema": "<any>",
"model": null,
"ocr_strategy": "All",
"pipeline": null,
"segment_processing": {
"Caption": null,
"Footnote": null,
"Formula": null,
"ListItem": null,
"Page": null,
"PageFooter": null,
"PageHeader": null,
"Picture": null,
"SectionHeader": null,
"Table": null,
"Text": null,
"Title": null
},
"segmentation_strategy": "LayoutAnalysis",
"target_chunk_length": 123
},
"created_at": "2023-11-07T05:31:56Z",
"expires_at": "2023-11-07T05:31:56Z",
"finished_at": "2023-11-07T05:31:56Z",
"message": "<string>",
"output": null,
"started_at": "2023-11-07T05:31:56Z",
"status": "Starting",
"task_id": "<string>",
"task_url": "<string>"
}
Authorizations
Path Parameters
Body
Controls the setting for the chunking and post-processing of each chunk.
Whether to ignore headers and footers in the chunking process. This is recommended as headers and footers break reading order across pages.
The target number of words in each chunk. If 0, each chunk will contain a single segment.
The number of seconds until task is deleted. Expried tasks can not be updated, polled or accessed via web interface.
Whether to use high-resolution images for cropping and post-processing. (Latency penalty: ~7 seconds per page)
Controls the Optical Character Recognition (OCR) strategy.
All
: Processes all pages with OCR. (Latency penalty: ~0.5 seconds per page)Auto
: Selectively applies OCR only to pages with missing or low-quality text. When text layer is present the bounding boxes from the text layer are used.
All
, Auto
The pipeline to use for processing. If pipeline is set to Azure then Azure layout analysis will be used for segmentation and OCR. The output will be unified to the Chunkr output.
Azure
Controls the post-processing of each segment type.
Allows you to generate HTML and Markdown from chunkr models for each segment type.
By default, the HTML and Markdown are generated manually using the segmentation information except for Table
and Formula
.
You can optionally configure custom LLM prompts and models to generate an additional llm
field
with LLM-processed content for each segment type.
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the segmentation strategy:
LayoutAnalysis
: Analyzes pages for layout elements (e.g.,Table
,Picture
,Formula
, etc.) using bounding boxes. Provides fine-grained segmentation and better chunking. (Latency penalty: ~TBD seconds per page).Page
: Treats each page as a single segment. Faster processing, but without layout element detection and only simple chunking.
LayoutAnalysis
, Page
Response
The configuration used for the task.
Controls the setting for the chunking and post-processing of each chunk.
Whether to ignore headers and footers in the chunking process. This is recommended as headers and footers break reading order across pages.
The target number of words in each chunk. If 0, each chunk will contain a single segment.
Whether to use high-resolution images for cropping and post-processing.
Controls the Optical Character Recognition (OCR) strategy.
All
: Processes all pages with OCR. (Latency penalty: ~0.5 seconds per page)Auto
: Selectively applies OCR only to pages with missing or low-quality text. When text layer is present the bounding boxes from the text layer are used.
All
, Auto
Controls the post-processing of each segment type.
Allows you to generate HTML and Markdown from chunkr models for each segment type.
By default, the HTML and Markdown are generated manually using the segmentation information except for Table
and Formula
.
You can optionally configure custom LLM prompts and models to generate an additional llm
field
with LLM-processed content for each segment type.
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the processing and generation for the segment.
crop_image
controls whether to crop the file's images to the segment's bounding box. The cropped image will be stored in the segment'simage
field. UseAll
to always crop, orAuto
to only crop when needed for post-processing.html
is the HTML output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)llm
is the LLM-generated output for the segment, this uses off-the-shelf models to generate a custom output for the segmentmarkdown
is the Markdown output for the segment, generated either through huerstics (Auto
) or using Chunkr fine-tuned models (LLM
)
Controls the cropping strategy for an item (e.g. segment, chunk, etc.)
All
crops all images in the itemAuto
crops images only if required for post-processing
All
, Auto
LLM
, Auto
Prompt for the LLM model
LLM
, Auto
Controls the segmentation strategy:
LayoutAnalysis
: Analyzes pages for layout elements (e.g.,Table
,Picture
,Formula
, etc.) using bounding boxes. Provides fine-grained segmentation and better chunking. (Latency penalty: ~TBD seconds per page).Page
: Treats each page as a single segment. Faster processing, but without layout element detection and only simple chunking.
LayoutAnalysis
, Page
The number of seconds until task is deleted. Expried tasks can not be updated, polled or accessed via web interface.
The presigned URL of the input file.
Fast
, HighQuality
Azure
The target number of words in each chunk. If 0, each chunk will contain a single segment.
The date and time when the task was created and queued.
A message describing the task's status or any errors that occurred.
The status of the task.
Starting
, Processing
, Succeeded
, Failed
, Cancelled
The unique identifier for the task.
The date and time when the task will expire.
The date and time when the task was finished.
The processed results of a document analysis task
Collection of document chunks, where each chunk contains one or more segments
The total number of words in the chunk.
x > 0
Collection of document segments that form this chunk.
When target_chunk_length
> 0, contains the maximum number of segments
that fit within that length (segments remain intact).
Otherwise, contains exactly one segment.
Bounding box for an item. It is used for chunks, segments and OCR results.
The height of the bounding box.
The left coordinate of the bounding box.
The top coordinate of the bounding box.
The width of the bounding box.
Height of the page containing the segment.
Page number of the segment.
x > 0
Width of the page containing the segment.
Unique identifier for the segment.
All the possible types for a segment. Note: Different configurations will produce different types. Please refer to the documentation for more information.
Caption
, Footnote
, Formula
, ListItem
, Page
, PageFooter
, PageHeader
, Picture
, SectionHeader
, Table
, Text
, Title
Confidence score of the layout analysis model
Text content of the segment.
HTML representation of the segment.
Presigned URL to the image of the segment.
LLM representation of the segment.
Markdown representation of the segment.
OCR results for the segment.
OCR results for a segment
The unique identifier for the chunk.
Suggested text to be embed for search.
The extracted JSON from the document.
The name of the file.
The number of pages in the file.
x > 0
The presigned URL of the PDF file.
The date and time when the task was started.
The presigned URL of the task.
curl --request PATCH \
--url https://api.chunkr.ai/api/v1/task/{task_id}/parse \
--header 'Authorization: <api-key>' \
--header 'Content-Type: application/json' \
--data '{
"chunk_processing": null,
"expires_in": 123,
"high_resolution": true,
"ocr_strategy": null,
"pipeline": null,
"segment_processing": null,
"segmentation_strategy": null
}'
{
"configuration": {
"chunk_processing": {
"ignore_headers_and_footers": true,
"target_length": 512
},
"expires_in": 123,
"high_resolution": true,
"input_file_url": "<string>",
"json_schema": "<any>",
"model": null,
"ocr_strategy": "All",
"pipeline": null,
"segment_processing": {
"Caption": null,
"Footnote": null,
"Formula": null,
"ListItem": null,
"Page": null,
"PageFooter": null,
"PageHeader": null,
"Picture": null,
"SectionHeader": null,
"Table": null,
"Text": null,
"Title": null
},
"segmentation_strategy": "LayoutAnalysis",
"target_chunk_length": 123
},
"created_at": "2023-11-07T05:31:56Z",
"expires_at": "2023-11-07T05:31:56Z",
"finished_at": "2023-11-07T05:31:56Z",
"message": "<string>",
"output": null,
"started_at": "2023-11-07T05:31:56Z",
"status": "Starting",
"task_id": "<string>",
"task_url": "<string>"
}