POST /api/ocr

The /api/ocr endpoint extracts plain text from invoice images or PDFs without structuring. Use this for debugging, testing model vision capabilities, or when you only need raw text.

Endpoint

POST /api/ocr

Request Body

imageBase64

string

Base64-encoded image data or data URL. Required if pdfUrl and pdfBase64 are not provided.Formats:

Bare base64: iVBORw0KGgoAAAANS...
Data URL: data:image/png;base64,iVBORw0KGgo...

mimeType

string

default:"image/png"

MIME type of the image (e.g., image/png, image/jpeg, image/webp). Only used if imageBase64 is bare base64.

pdfUrl

string

Publicly accessible URL to a PDF document. Required if imageBase64 and pdfBase64 are not provided.

pdfBase64

string

Base64-encoded PDF data or data URL. Required if imageBase64 and pdfUrl are not provided.Formats:

Bare base64: JVBERi0xLjQKJeLjz9...
Data URL: data:application/pdf;base64,JVBERi0xLjQK...

filename

string

default:"document.pdf"

Optional filename for PDF documents (used in OpenRouter request)

model

string

Override the default model. Examples: openai/gpt-4o, google/gemini-2.0-flash, anthropic/claude-3.5-sonnet

plugins

array

Override PDF parsing plugins. Advanced use only.Default:

[
  {
    "id": "file-parser",
    "pdf": { "engine": "pdf-text" }
  }
]

annotations

object

Pass-through OpenRouter annotations to avoid re-parsing costs on subsequent requests

Response

text

string

Extracted raw text from the document, preserving line breaks

Error Response

error

string

Error message describing what went wrong

Example Requests

curl -X POST https://your-domain.com/api/ocr \
  -H "Content-Type: application/json" \
  -d '{
    "imageBase64": "data:image/png;base64,iVBORw0KGgoAAAANS...",
    "model": "openai/gpt-4o-mini"
  }'

Example Response

Success (200)

{
  "text": "TAX INVOICE\n\nInvoice No: INV-2024-001\nDate: 15-01-2024\n\nBill To:\nAcme Corporation\n123 Business Street\nMumbai, Maharashtra 400001\nGSTIN: 27AABCU9603R1ZM\n\nItems:\n1. Premium Widget - Qty: 10 - Rate: ₹1,000 - Amount: ₹10,000\n   HSN: 8517 - GST 18% (CGST 9% + SGST 9%)\n\nSubtotal: ₹10,000\nCGST (9%): ₹900\nSGST (9%): ₹900\nTotal: ₹11,800"
}

Error (400)

{
  "error": "Provide 'imageBase64' or 'pdfUrl' or 'pdfBase64'"
}

Error (500)

{
  "error": "OpenRouter error: 401 Invalid API key"
}

Implementation Details

System Prompt

The endpoint uses this system prompt to guide text extraction:

You are an OCR extractor. Return only the raw, verbatim text from the image. 
Preserve line breaks. Do not add commentary or extra labels.

PDF Processing

For PDFs, the endpoint:

Configures the file-parser plugin with the specified engine
Sends the PDF to OpenRouter with multi-page support
Extracts text across all pages

The OPENROUTER_PDF_ENGINE environment variable controls the parsing engine:

pdf-text (default) - Fast, text-layer extraction
mistral-ocr - OCR-based, better for scanned PDFs
native - Model’s native PDF support

Model Parameters

temperature: 0 - Deterministic output
response_format - Not enforced (plain text response)

Use Cases

Debugging

Verify what the model sees before attempting structured extraction

Format Testing

Test if your invoice format is readable by vision models

Search Indexing

Extract text for full-text search or keyword indexing

Custom Parsing

Get raw text and apply your own parsing logic

Limitations

This endpoint returns unstructured text. For structured invoice data, use /api/ocr-structured or /api/ocr-structured-v4.

No schema validation or data structuring
No reconciliation or calculation verification
Line breaks and formatting depend on model interpretation
Multi-column layouts may not preserve column order

Next Steps

Structured Extraction

Extract structured JSON with MyBillBook schema

V4 Schema

Advanced India GST extraction with reconciliation

Endpoints

Schemas

Endpoint

Request Body

Response

Error Response

Example Requests

Example Response

Implementation Details

System Prompt

PDF Processing

Model Parameters

Use Cases

Debugging

Format Testing

Search Indexing

Custom Parsing

Limitations

Next Steps

Structured Extraction

V4 Schema

Build docs developers (and LLMs) love

Endpoints

Schemas

​Endpoint

​Request Body

​Response

​Error Response

​Example Requests

​Example Response

​Implementation Details

​System Prompt

​PDF Processing

​Model Parameters

​Use Cases

Debugging

Format Testing

Search Indexing

Custom Parsing

​Limitations

​Next Steps

Structured Extraction

V4 Schema

Build docs developers (and LLMs) love

Endpoint

Request Body

Response

Error Response

Example Requests

Example Response

Implementation Details

System Prompt

PDF Processing

Model Parameters

Use Cases

Limitations

Next Steps