Extract text from images
ocr_strategy
parameter.
We have two strategies:
All
(Default): Processes all pages with our OCR model.Auto
: Intelligently applies OCR only to pages with missing or low-quality text. When a text layer is present, the bounding boxes from that layer are used instead of running OCR.Auto
strategy provides the best balance between accuracy and performance for most use cases.
Use the All
strategy when you need to ensure consistent text extraction across all pages or when you suspect the existing text layer might be unreliable.