> ## Documentation Index
> Fetch the complete documentation index at: https://docs.chunkr.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Welcome to Chunkr

> Complex, messy documents to high-quality data

<Frame>
  <img src="https://mintcdn.com/lumina-53fcea44/yRc36sZAzbhE6z9a/assets/chunkr_banner.png?fit=max&auto=format&n=yRc36sZAzbhE6z9a&q=85&s=dc2101baaaa3c3ddc87502fb69431222" alt="Chunkr documentation banner image" width="4260" height="2400" data-path="assets/chunkr_banner.png" />
</Frame>

Chunkr turns complex documents like PDFs, spreadsheets, and images into clean data - fast, accurate, and at scale.
We build industry leading VLMs + computer-vision models to deliver structured, machine-readable outputs with unmatched accuracy.

This guide contains everything you need to understand Chunkr. If anything is missing,
we're here to help at [support@chunkr.ai](mailto:support@chunkr.ai).

## Get Started

<CardGroup>
  <Card title="Quickstart" href="/pages/get-started/quickstart" icon="bolt">
    Get started in under 2 minutes
  </Card>

  <Card title="API Reference/SDKs" href="/api-references/tasks/create-extract-task" icon="rocket">
    Powerful Python and Typescript libraries
  </Card>

  <Card title="Web Interface" href="/pages/get-started/web-interface" icon="browser">
    Test documents and view results instantly
  </Card>
</CardGroup>

***

## Features

<CardGroup>
  <Card title="Task System" href="/pages/task-system/overview" icon="upload">
    A simple, task-based API that gives you full control over your document
    ingestion.
  </Card>

  <Card title="Parse" href="/pages/features/parse/overview" icon="crop-simple">
    Turn any document into LLM-ready data. Markdown, bounding boxes, etc.
  </Card>

  <Card title="Extract" href="/pages/features/extract/overview" icon="brackets-curly">
    Auto-fill custom schemas. Citations, confidence scores, structured JSON.
  </Card>
</CardGroup>

***

## What can I do with Chunkr?

<b>
  Chunkr is built for any AI and developer team that works with messy documents
  at scale.
</b>

You can process a [vast array of document](/pages/get-started/file-types) types across any industry. Use cases are wide and varied, here are some of the things folks build with our outputs:

### Standout AI Applications

* **Intelligent RAG systems**: Feed your Retrieval-Augmented Generation pipelines with perfectly chunked, application-ready content from any document.
* **Power document-first applications**: Leverage bounding boxes, citations, and precise OCR to build visual search tools, verification interfaces, and interactive document experiences.
* **Create specialized AI agents**: Develop sophisticated agents that can reason over and extract insights from legal contracts, financial reports, spreadsheets, or scientific papers.

### Automate Critical Workflows

* **Finance**: Automate data entry and accelerate financial analysis by processing high volumes of invoices, bank statements, and 10-K/10-Q reports.
* **Legal**: Streamline compliance and legal review by automating data extraction from regulatory filings, contracts, and evidence documents with fully auditable, citation-backed results.
* **Supply Chain**: Digitize and process bills of lading, packing slips, and purchase orders to enhance logistics, reduce manual errors, and speed up your supply chain.

***

## Security and Trust

Security is at the core of our platform. We offer a SOC 2 and HIPAA-compliant service, never train on your data, and provide on-premise solutions for maximum control. We also maintain backwards compatibility to ensure a stable, reliable platform you can depend on.

<CardGroup>
  <Card title="Policies" href="/pages/security/policies" icon="shield-check">
    Explore our comprehensive security policies and commitment to data privacy.
  </Card>

  <Card title="On Premise" href="/pages/security/on-premise" icon="server">
    Learn about our on-premise offerings for maximum data control and security.
  </Card>
</CardGroup>
