In addition to using chunkr’s default models, we also provide a pipeline interface to allow you to use Azure Document Intelligence as a provider. When using Azure, instead of the default models, your files are processed through the Azure layout analysis model, the Azure OCR model, and the Azure table OCR model. You can still leverage Chunkr’s intelligent chunking and segment processing. The output will be mapped to the Chunkr output format.Documentation Index
Fetch the complete documentation index at: https://docs.chunkr.ai/llms.txt
Use this file to discover all available pages before exploring further.
When to use Azure
- If our queue is full, you can use Azure to process your files
- If you don’t need VLMs on your tables, you can use the Azure table OCR model. This will allow you to get much faster results.
- Better OCR (we are working on it!)
Example
- Use default segment processing and chunking with the Chunkr layout analysis model and OCR model.
- Use default chunking with the Azure layout analysis model, OCR model and table OCR model.
In this case, the content for the
Tablesegment will be generated by the Azure table OCR model.