Skip to main content
Beyond the major AI frameworks, Docling integrates with a rich ecosystem of tools and platforms for document processing, data preparation, knowledge graphs, and enterprise solutions.

Knowledge Graphs & Semantic AI

Semantica

Semantica Docling is natively integrated with Semantica, an open-source framework for building semantic layers and knowledge graphs from unstructured data. By combining Docling’s high-fidelity parsing with Semantica’s knowledge engineering, you can transform complex documents into AI-ready, structured knowledge for GraphRAG and multi-agent systems. Key Features:
  • Native DoclingParser integration
  • Knowledge Graph & RDF Triplet construction
  • Entity normalization & deduplication
  • Automated ontology generation
Example:
from semantica.parse import DoclingParser
from semantica.semantic_extract import TripletExtractor

# Parse with Docling
parser = DoclingParser(enable_ocr=True)
result = parser.parse("earnings_call.pdf")

# Extract semantic triplets
extractor = TripletExtractor()
triplets = extractor.extract_triplets(result["full_text"])

Documentation

Semantica docs

GitHub

Source code

Example

Earnings analysis notebook

PyPI

Install package

Cloud Platforms

Apify

Run Docling Actor on Apify Run Docling in the cloud without installation using the Docling Actor on the Apify platform. Example:
apify call vancura/docling -i '{
  "options": {
    "to_formats": ["md", "json", "html", "text", "doctags"]
  },
  "http_sources": [
    {"url": "https://arxiv.org/pdf/2408.09869"}
  ]
}'
Benefits:
  • No local installation required
  • Cloud-based processing
  • Multiple output formats
  • API and CLI access

Run on Apify

Docling Actor

Documentation

Actor docs

Performance & Optimization

Metaxy

Metaxy Combine Docling with Metaxy, a framework for optimizing compute-intensive workloads. Teams can save compute costs while accelerating experimentation for multimodal and HPC-oriented document processing workloads.

Documentation

Metaxy docs

Example

Docling + Metaxy example

Walkthrough

HPC walkthrough

Slides

Presentation

Low-Code & Visual Platforms

Langflow

Docling is available on Langflow, a visual low-code platform for building AI applications.

Documentation

Langflow Docling docs

Video Tutorial

Video guide

GitHub

Langflow repository

Kotaemon

Docling is available in Kotaemon as the DoclingReader loader for RAG applications.

Documentation

DoclingReader docs

Setup Guide

Configuration

GitHub

Kotaemon repository

Data Processing & Preparation

Data Prep Kit

Docling is used by the Data Prep Kit for preparing unstructured data at scale, from laptop to datacenter. Components:
  • PDF to Parquet - Batch document conversion
  • Document Chunking - Intelligent text chunking

PDF2Parquet

PDF ingestion docs

Doc Chunking

Chunking docs

GitHub

Data Prep Kit repository

NLP & Text Processing

spaCy Layout

Docling is available in spaCy as the spaCy Layout plugin for NLP pipelines.

Documentation

SpacyLayout docs

Blog Post

Announcement

GitHub

Source code

PyPI

Install package

txtai

Docling is available as a text extraction backend for txtai.

Documentation

txtai docs

Integration

Docling backend

GitHub

txtai repository

Annotation & Labeling

Prodigy

Docling is available in Prodigy as a Prodigy-PDF plugin recipe for document annotation.

Prodigy Home

Prodigy platform

PDF Plugin

Plugin docs

Recipe

pdf-spans.manual recipe

Blog Post

Announcement

Enterprise & AI Platforms

InstructLab

Docling powers document processing in InstructLab, enabling knowledge extraction for AI model fine-tuning.

InstructLab Home

Platform overview

Documentation

InstructLab docs

UI

InstructLab UI

Blog Post

Red Hat announcement

NVIDIA AI Blueprints

Docling powers NVIDIA’s PDF to Podcast agentic AI blueprint.

Blueprint

PDF to Podcast

GitHub

Source code

Announcement

NVIDIA News

Blog

Agentic AI blog

Additional Integrations

Docling is also used in:
  • Red Hat AI - Enterprise AI platform
  • Cloudera - Enterprise data platform
  • Quarkus - Java framework integration
  • Bee Agent - Agent framework
  • Arconia - Document platform
  • DocETL - Document ETL pipelines
  • Hector - Document processing
  • OpenContracts - Contract analysis
  • OpenWebUI - Web interface
  • Vectara - Neural search platform
For the complete list of integrations, visit the Docling integrations directory.

Integration Ecosystem

Docling’s rich integration ecosystem enables:
1

Flexible Deployment

Use Docling locally, in the cloud, or at datacenter scale
2

Framework Choice

Work with your preferred AI framework or platform
3

Specialized Workflows

Leverage domain-specific tools for your use case
4

Production Ready

Deploy with enterprise-grade platforms and tools

Next Steps

Build docs developers (and LLMs) love