Knowledge Graphs & Semantic AI
Semantica
Docling is natively integrated with Semantica, an open-source framework for building semantic layers and knowledge graphs from unstructured data.
By combining Docling’s high-fidelity parsing with Semantica’s knowledge engineering, you can transform complex documents into AI-ready, structured knowledge for GraphRAG and multi-agent systems.
Key Features:
- Native
DoclingParserintegration - Knowledge Graph & RDF Triplet construction
- Entity normalization & deduplication
- Automated ontology generation
Documentation
Semantica docs
GitHub
Source code
Example
Earnings analysis notebook
PyPI
Install package
Cloud Platforms
Apify
Run Docling in the cloud without installation using the Docling Actor on the Apify platform.
Example:
- No local installation required
- Cloud-based processing
- Multiple output formats
- API and CLI access
Run on Apify
Docling Actor
Documentation
Actor docs
Performance & Optimization
Metaxy
Documentation
Metaxy docs
Example
Docling + Metaxy example
Walkthrough
HPC walkthrough
Slides
Presentation
Low-Code & Visual Platforms
Langflow
Docling is available on Langflow, a visual low-code platform for building AI applications.Documentation
Langflow Docling docs
Video Tutorial
Video guide
GitHub
Langflow repository
Kotaemon
Docling is available in Kotaemon as theDoclingReader loader for RAG applications.
Documentation
DoclingReader docs
Setup Guide
Configuration
GitHub
Kotaemon repository
Data Processing & Preparation
Data Prep Kit
Docling is used by the Data Prep Kit for preparing unstructured data at scale, from laptop to datacenter. Components:- PDF to Parquet - Batch document conversion
- Document Chunking - Intelligent text chunking
PDF2Parquet
PDF ingestion docs
Doc Chunking
Chunking docs
GitHub
Data Prep Kit repository
NLP & Text Processing
spaCy Layout
Docling is available in spaCy as the spaCy Layout plugin for NLP pipelines.Documentation
SpacyLayout docs
Blog Post
Announcement
GitHub
Source code
PyPI
Install package
txtai
Docling is available as a text extraction backend for txtai.Documentation
txtai docs
Integration
Docling backend
GitHub
txtai repository
Annotation & Labeling
Prodigy
Docling is available in Prodigy as a Prodigy-PDF plugin recipe for document annotation.Prodigy Home
Prodigy platform
PDF Plugin
Plugin docs
Recipe
pdf-spans.manual recipe
Blog Post
Announcement
Enterprise & AI Platforms
InstructLab
Docling powers document processing in InstructLab, enabling knowledge extraction for AI model fine-tuning.InstructLab Home
Platform overview
Documentation
InstructLab docs
UI
InstructLab UI
Blog Post
Red Hat announcement
NVIDIA AI Blueprints
Docling powers NVIDIA’s PDF to Podcast agentic AI blueprint.Blueprint
PDF to Podcast
GitHub
Source code
Announcement
NVIDIA News
Blog
Agentic AI blog
Additional Integrations
Docling is also used in:- Red Hat AI - Enterprise AI platform
- Cloudera - Enterprise data platform
- Quarkus - Java framework integration
- Bee Agent - Agent framework
- Arconia - Document platform
- DocETL - Document ETL pipelines
- Hector - Document processing
- OpenContracts - Contract analysis
- OpenWebUI - Web interface
- Vectara - Neural search platform
Integration Ecosystem
Docling’s rich integration ecosystem enables:Next Steps
- Explore core integrations
- Learn about document conversion
- Check out example applications
- Visit the Docling GitHub for the latest integrations