Document Intelligence
Azure Document Intelligence is a cloud-based AI service for extracting information from documents using OCR and machine learning. Transform documents into intelligent data-driven solutions by automatically extracting text, tables, structure, and key-value pairs.What is Document Intelligence?
Document Intelligence uses machine-learning models to extract and analyze:- Text: Printed and handwritten content
- Structure: Tables, sections, and layout
- Key-value pairs: Form fields and their values
- Entities: Specific data like dates, amounts, names
Core Capabilities
Document Analysis
Extract text, tables, and structure from any document
Prebuilt Models
Ready-to-use models for invoices, receipts, IDs, and more
Custom Models
Train models for your specific document types
Document Analysis Models
General-purpose models for extracting content from documents:Read Model
Extract text from documents:- Printed and handwritten text extraction
- Multi-language support
- High accuracy OCR
- Text with position information
- Searchable PDF output
Layout Model
Extract text, tables, and document structure:- Text extraction
- Table detection and extraction
- Section headers
- Paragraphs and reading order
- Selection marks (checkboxes)
- Barcodes and QR codes
Prebuilt Models
Pre-trained models for common document types - no training required:Financial Documents
- Invoice
- Receipt
- Bank Statement
Extract key information from invoices:
- Vendor details (name, address, tax ID)
- Customer information
- Invoice number and date
- Line items with quantities and amounts
- Subtotals and tax amounts
- Total amount due
Identity Documents
ID Cards and Passports
ID Cards and Passports
Extract from driver’s licenses, passports, and ID cards:
- First and last name
- Date of birth
- Document number
- Expiration date
- Address
- Country/region
- Machine readable zone (MRZ)
- U.S. driver’s licenses
- U.S. passports
- International passports
- National ID cards
Health Insurance Card
Health Insurance Card
Extract from U.S. health insurance cards:
- Insurer name
- Member name and ID
- Group number
- Dependent information
- Prescription information
- Medicare/Medicaid ID
Tax Documents
Prebuilt models for U.S. tax forms:- W-2: Wage and tax statement
- 1098: Mortgage interest statement
- 1099: Income forms (all variations)
- 1040: Individual tax return (all variations)
- Unified Tax Model: Automatically detect and process any supported tax form
Mortgage Documents
Models for mortgage loan processing:- 1003 URLA: Uniform Residential Loan Application
- 1004 URAR: Uniform Residential Appraisal Report
- 1005: Verification of Employment
- 1008: Uniform Underwriting and Transmittal Summary
- Closing Disclosure: Final loan terms and costs
Custom Models
Train models on your specific document types when prebuilt models don’t fit:Custom Template Model
For structured documents with consistent layouts:- Fixed form templates
- Consistent field positions
- 5+ sample documents needed
- Fast training time
- High accuracy for fixed layouts
Custom Neural Model
For unstructured or varying layouts:- Variable document structures
- Handwritten content
- Mixed document types
- 100+ sample documents recommended
- Longer training time
- Handles layout variations
Custom Classifier
Classify documents into categories:- Identify document types
- Route to appropriate model
- Process mixed document batches
- 5+ samples per class needed
Composed Models
Combine multiple custom models:- Group related document types
- Single endpoint for multiple forms
- Automatic model selection
- Simplify API calls
Add-on Capabilities
Optional features to enhance extraction:- High Resolution Extraction: Better accuracy for small text
- Formula Extraction: Extract mathematical formulas
- Font Property Extraction: Identify fonts and styling
- Barcode Extraction: Read 1D and 2D barcodes
- Query Fields: Extract specific information using natural language
- Key-Value Pairs: Find form fields automatically
Development Options
Document Intelligence Studio
Web-based tool for testing and labeling documents
REST API
Direct HTTP API access for any programming language
Python SDK
C# SDK
Java SDK
Maven dependency for Java applications
JavaScript SDK
Use Cases
Accounts Payable
- Automated invoice processing
- Extract vendor and payment details
- Integrate with accounting systems
- Reduce manual data entry
Tax Processing
- Extract data from tax forms
- Automate tax return preparation
- Process W-2s and 1099s
- Verify tax document accuracy
Identity Verification
- KYC compliance
- Extract ID information
- Verify identity documents
- Automate onboarding
Mortgage Processing
- Process loan applications
- Extract appraisal data
- Verify employment
- Analyze closing documents
Input Requirements
- Formats: PDF, JPEG, PNG, BMP, TIFF, HEIF
- File Size: Up to 500 MB (2 GB for PDF)
- Pages: Up to 2,000 pages
- Resolution: 50 x 50 to 10,000 x 10,000 pixels
Region Availability
Document Intelligence is available in these Azure regions:- East US
- West US 2
- West Europe
- North Europe
- And more regions
Getting Started
Try Document Intelligence Studio
Test prebuilt models with sample documents at documentintelligence.ai.azure.com
Pricing
- Free Tier (F0): 500 pages per month
- Standard Tier (S0): Pay per page analyzed
- Custom model training: Additional charges
- Storage: Required for training data