[defaults.routing] and override per agent in [agents.routing].
Process-Type Routing
Model used by channel processes (user-facing conversations).
Model used by branch processes (forked thinking).
Model used by worker processes (task execution).
Model used by compactor processes (context summarization).
Model used by cortex processes (system observation, bulletin generation).
Model used for voice processing (if enabled).
Task-Type Overrides
Task-specific model overrides for workers and branches. When a worker or branch is spawned with a specific task type, use the override model instead of the default.
Thinking Effort
Thinking effort level for channel models. Controls extended thinking tokens.Valid values:
auto, low, medium, highThinking effort level for branch models.
Thinking effort level for worker models.
Thinking effort level for compactor models.
Thinking effort level for cortex models.
Fallback Chains
Fallback chains for resilience. When a model fails with a retriable error (429, 502, 503, 504, rate limit), try the next model in its chain.
How long to deprioritize a rate-limited model (seconds). After a rate limit error, the model is avoided for this duration.
Model Name Format
Model names follow the formatprovider/model-id. The provider must match a configured LLM provider or custom provider.
Built-in Providers
anthropic/- Anthropic models (e.g.,anthropic/claude-sonnet-4)openai/- OpenAI models (e.g.,openai/gpt-4.1)openrouter/- OpenRouter models (e.g.,openrouter/anthropic/claude-sonnet-4-20250514)kilo/- Kilo Gateway models (e.g.,kilo/anthropic/claude-sonnet-4.5)gemini/- Google Gemini models (e.g.,gemini/gemini-2.0-flash-exp)groq/- Groq models (e.g.,groq/llama-3.3-70b-versatile)deepseek/- DeepSeek models (e.g.,deepseek/deepseek-chat)xai/- xAI models (e.g.,xai/grok-2-latest)mistral/- Mistral AI models (e.g.,mistral/mistral-large-latest)together/- Together AI modelsfireworks/- Fireworks AI modelsnvidia/- NVIDIA NIM modelsollama/- Ollama models (e.g.,ollama/llama3.3)
Custom Providers
For custom providers defined in[llm.provider.<name>], use <name>/model-id:
Examples
Single Provider (Anthropic)
Mixed Providers with Task Overrides
OpenRouter with Fallbacks
Per-Agent Routing Override
Thinking Effort Configuration
Retriable Errors
The following errors trigger fallback to the next model in the chain:- HTTP 429 (rate limit)
- HTTP 502 (bad gateway)
- HTTP 503 (service unavailable)
- HTTP 504 (gateway timeout)
- Connection errors
- Timeouts
- Empty responses
- Malformed responses