Introduction to Docling

Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

What is Docling?

Docling is a powerful SDK and CLI tool that converts documents from various formats (PDF, DOCX, PPTX, HTML, images, audio, and more) into a unified, structured representation perfect for downstream AI workflows. With state-of-the-art PDF understanding capabilities, OCR support, and native integrations with popular AI frameworks, Docling accelerates your document AI applications.

Installation

Get started with pip, uv, or your preferred package manager

Quick start

Convert your first document in minutes with simple Python code

Core concepts

Learn about Docling’s architecture and key components

Usage guides

Explore detailed guides for common workflows

Key features

Multi-format support

Docling handles multiple document formats out of the box:

PDF with advanced layout understanding, reading order, table structure, formulas, and more
Office documents: DOCX, PPTX, XLSX
Web formats: HTML
Images: PNG, TIFF, JPEG with OCR support
Audio: WAV, MP3 with Automatic Speech Recognition (ASR)
Specialized formats: LaTeX, WebVTT, XBRL financial reports, JATS articles, USPTO patents

Advanced PDF understanding

Docling excels at PDF processing with:

Page layout analysis - Detect headers, paragraphs, lists, tables, figures, and formulas
Reading order detection - Preserve logical document flow
Table structure recognition - Extract tables with accurate cell relationships
Code block detection - Identify and preserve code snippets
Formula understanding - Recognize mathematical equations
Image classification - Categorize figures and diagrams

Unified document representation

All documents are converted to the expressive DoclingDocument format, providing:

Structured content with hierarchical organization
Metadata preservation including fonts, styles, and layout information
Relationship tracking between document elements
Export flexibility to Markdown, HTML, JSON, DocTags, and more

AI framework integrations

Plug-and-play integrations with popular frameworks:

LangChain

Build RAG applications with LangChain document loaders

LlamaIndex

Index and query documents with LlamaIndex readers

Haystack

Create search pipelines with Haystack converters

Crew AI

Power AI agents with structured document data

Privacy and security

Docling supports local execution for sensitive data and air-gapped environments - all processing happens on your infrastructure with no external API calls required.

What’s new

Docling is actively developed with frequent improvements and new features.

Structured information extraction - Extract specific data using schemas [beta]
Heron layout model - Faster PDF parsing by default
MCP server - Connect to agentic applications via Model Context Protocol
XBRL parsing - Process financial reports in eXtensible Business Reporting Language
WebVTT support - Parse and export Web Video Text Tracks
LaTeX parsing - Convert LaTeX documents to structured format

Use cases

Docling powers a wide range of document AI applications:

RAG (Retrieval-Augmented Generation) - Convert documents for vector databases and semantic search
Document Q&A - Extract structured content for question answering systems
Data extraction - Pull tables, figures, and structured data from reports
Archive digitization - OCR and structure legacy documents
Compliance & analysis - Process financial reports, patents, and regulatory documents
Content migration - Convert documents between formats while preserving structure

Technical foundation

Docling leverages cutting-edge AI models including:

Layout models (Heron) for page segmentation
Table structure models for accurate table extraction
Visual Language Models (GraniteDocling) for enhanced understanding
OCR engines (EasyOCR, Tesseract, RapidOCR, macOS Vision)
ASR models (Whisper with MLX acceleration on Apple Silicon)

For technical details, see the Docling Technical Report (arXiv:2408.09869).

Get started

Install Docling

Install via pip or your preferred package manager

pip install docling

Convert a document

Run your first conversion with 3 lines of code

from docling.document_converter import DocumentConverter

converter = DocumentConverter()
result = converter.convert("document.pdf")
print(result.document.export_to_markdown())

Explore capabilities

Check out usage guides and examples to learn more

Next steps

Installation guide

Detailed installation instructions for all platforms

Quick start tutorial

Complete walkthrough with working examples

Join the community

Get help and share your projects on Discord

GitHub repository

Star the project and contribute on GitHub

Get Started

Core Concepts

Usage Guides

Advanced Features

Integrations

Introduction to Docling

What is Docling?

Installation

Quick start

Core concepts

Usage guides

Key features

Multi-format support

Advanced PDF understanding

Unified document representation

AI framework integrations

LangChain

LlamaIndex

Haystack

Crew AI

Privacy and security

What’s new

Use cases

Technical foundation

Get started

Next steps

Installation guide

Quick start tutorial

Join the community

GitHub repository

Build docs developers (and LLMs) love

Get Started

Core Concepts

Usage Guides

Advanced Features

Integrations

​What is Docling?

Installation

Quick start

Core concepts

Usage guides

​Key features

​Multi-format support

​Advanced PDF understanding

​Unified document representation

​AI framework integrations

LangChain

LlamaIndex

Haystack

Crew AI

​Privacy and security

​What’s new

​Use cases

​Technical foundation

​Get started

​Next steps

Installation guide

Quick start tutorial

Join the community

GitHub repository

Build docs developers (and LLMs) love

What is Docling?

Key features

Multi-format support

Advanced PDF understanding

Unified document representation

AI framework integrations

Privacy and security

What’s new

Use cases

Technical foundation

Get started

Next steps