Overview
CheckThat AI supports 18+ state-of-the-art AI models from 5 major providers. All models support claim normalization, with most supporting structured outputs, refinement, and evaluation.Model routing is handled automatically by the
LLMRouter class in api/_utils/LLMRouter.py, which detects the provider and initializes the appropriate API client.Supported Providers
OpenAI
GPT-5, GPT-4, o3, o4-miniBest for: Production use, balanced performance
Anthropic
Claude Sonnet 4, Claude Opus 4.1Best for: High-quality claims, nuanced understanding
Gemini 2.5 Pro, Gemini 2.5 FlashBest for: Fast inference, cost-effectiveness
xAI
Grok 3, Grok 4, Grok 3 MiniBest for: Real-time claims, current events
Meta (via Together AI)
Llama 3.3 70B TurboBest for: Open-source, customization
DeepSeek (via Together AI)
DeepSeek R1 Distill Llama 70BBest for: Research, reasoning tasks
Complete Model List
OpenAI Models
Display Name: GPT-5OpenAI’s latest flagship model with enhanced reasoning and claim understanding.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ✅ Streaming
Display Name: GPT-5 nanoEfficient version of GPT-5 optimized for speed and cost.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ✅ Streaming
Display Name: o3OpenAI’s reasoning model with advanced chain-of-thought capabilities.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ⚠️ Streaming (limited)
Display Name: o4-miniCompact reasoning model balancing quality and efficiency.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ⚠️ Streaming (limited)
Anthropic Models
Display Name: Claude Sonnet 4Anthropic’s balanced model with excellent claim understanding.Capabilities:
- ✅ Claim Normalization
- ⚠️ Structured Outputs (Pydantic only)
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ✅ Streaming
Display Name: Claude Opus 4.1Anthropic’s most capable model with superior reasoning.Capabilities:
- ✅ Claim Normalization
- ❌ Structured Outputs (not yet supported)
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ✅ Streaming
Google Models
Display Name: Gemini 2.5 ProGoogle’s flagship model with strong factual grounding.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs (JSON Schema)
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ✅ Streaming
Display Name: Gemini 2.5 FlashFast, efficient Gemini model optimized for speed.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs (JSON Schema)
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ✅ Streaming
xAI Models (Grok)
Display Name: Grok 3xAI’s base model with real-time knowledge integration.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ✅ Streaming
Display Name: Grok 4xAI’s latest model with enhanced reasoning and real-time web access.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ✅ Streaming
Display Name: Grok 3 MiniCompact Grok model for efficient processing.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs
- ✅ Refinement
- ✅ Evaluation (G-Eval)
- ✅ Streaming
Together AI Models
Display Name: Llama 3.3 70BMeta’s open-source flagship model with strong instruction following.Capabilities:
- ✅ Claim Normalization
- ✅ Structured Outputs (JSON Schema)
- ✅ Refinement
- ❌ Evaluation (not supported by DeepEval)
- ✅ Streaming
Display Name: DeepSeek R1 Distill Llama 70BDeepSeek’s reasoning-enhanced Llama distillation.Capabilities:
- ✅ Claim Normalization
- ⚠️ Structured Outputs (limited)
- ✅ Refinement
- ❌ Evaluation (not supported by DeepEval)
- ✅ Streaming
Model Comparison Table
| Model | Provider | Quality | Speed | Cost | Best For |
|---|---|---|---|---|---|
| GPT-5 | OpenAI | ⭐⭐⭐⭐⭐ | ⚡⚡⚡⚡ | $$$ | Production, high-stakes |
| Claude Opus 4.1 | Anthropic | ⭐⭐⭐⭐⭐ | ⚡⚡⚡ | $$$ | Complex claims, quality |
| Claude Sonnet 4 | Anthropic | ⭐⭐⭐⭐ | ⚡⚡⚡⚡ | $$ | Balanced, production |
| Gemini 2.5 Pro | ⭐⭐⭐⭐ | ⚡⚡⚡⚡⚡ | $$ | Factual consistency | |
| Gemini 2.5 Flash | ⭐⭐⭐ | ⚡⚡⚡⚡⚡ | $ | High volume, speed | |
| Grok 4 | xAI | ⭐⭐⭐⭐ | ⚡⚡⚡⚡ | $$ | Current events |
| GPT-5 nano | OpenAI | ⭐⭐⭐⭐ | ⚡⚡⚡⚡⚡ | $$ | Cost-effective quality |
| Llama 3.3 70B | Together AI | ⭐⭐⭐ | ⚡⚡⚡⚡ | Free | Open-source, research |
API Configuration
Automatic Model Routing
CheckThat AI automatically routes requests to the correct provider:From api/_utils/LLMRouter.py:14-43
Model Constants
From api/_types.py:4-17
Using Different Models
Structured Output Support
Models with structured output support can return JSON according to a schema:From api/_types.py:42-55
Provider-Specific Features
OpenAI
From api/_utils/openai.py:18-29
- Native structured outputs via
response_format - Streaming support
- Function calling
- JSON mode
- Vision capabilities (GPT-4o, GPT-5)
Anthropic
From api/_utils/anthropic.py:21-32
- Streaming support
- System message separation
- Instructor integration for Pydantic models
- OpenAI-compatible response formatting
Google (Gemini)
From api/_utils/gemini.py:19-33
- Native JSON schema support
- System instructions
- Streaming support
- Safety ratings
- Multimodal capabilities
xAI (Grok)
From api/_utils/xai.py:20-32
- OpenAI-compatible API
- Real-time web access
- Streaming support
- JSON schema support
Together AI
From api/_utils/togetherAI.py:19-30
- Free tier for Llama and DeepSeek models
- OpenAI-compatible responses
- JSON schema support
- Streaming support
Model Selection Guide
- By Use Case
- By Budget
- By Latency
Production Fact-Checking
- Primary: Claude Sonnet 4, GPT-5 nano
- Backup: Gemini 2.5 Pro
- Primary: GPT-5, Claude Opus 4.1
- Backup: Gemini 2.5 Pro
- Primary: Gemini 2.5 Flash
- Backup: GPT-5 nano, Grok 3 Mini
- Primary: Grok 4
- Backup: Grok 3, GPT-5
- Primary: Llama 3.3 70B (free)
- Backup: Gemini 2.5 Flash, GPT-5 nano
- Primary: DeepSeek R1, Llama 3.3 70B
- Backup: o3, Claude Opus 4.1
Best Practices
Start with Balanced Models
Begin with Claude Sonnet 4 or GPT-5 nano for the best balance of quality, speed, and cost.
Use Faster Models for Refinement
If using refinement, consider faster models (Gemini Flash, GPT-5 nano) to reduce latency.
Reserve Premium Models for Hard Cases
Use GPT-5 or Claude Opus only for complex claims that require maximum quality.
Next Steps
Claim Normalization
Learn how models normalize claims from social media posts
Refinement Pipeline
See how models iteratively improve claim quality
Evaluation Metrics
Understand how models evaluate claim quality
API Reference
Start using models via the CheckThat AI API