Overview
Claude Octopus is a multi-AI orchestration framework that coordinates three AI providers through a plugin architecture for Claude Code. The system turns one model into three, assigning each provider a distinct role with adversarial review and consensus gates.Claude Octopus works with just Claude out of the box. Add Codex or Gemini to unlock multi-AI orchestration features.
Core components
The system is built from four main components that work together:orchestrate.sh
The orchestration engine that coordinates all AI providers and manages workflow execution
Agents & Personas
33 specialized personas that activate automatically based on task requirements
Skills & Workflows
39 commands implementing Double Diamond phases and specialized workflows
Hooks & State
Event-driven hooks and persistent state management for session continuity
Component interaction
Data flow
- User request - Natural language or command triggers a workflow
- Plugin routing - Claude Octopus parses intent and selects appropriate skill
- Orchestration -
orchestrate.shcoordinates provider execution:- Parallel for research and review phases
- Sequential for definition and synthesis
- Adversarial for debate and validation
- Provider execution - External CLIs execute with isolated environments
- Consensus building - 75% agreement threshold enforced at quality gates
- Result synthesis - Claude combines perspectives into unified output
- State persistence - Decisions and context stored in
.octo/and~/.claude-octopus/
Plugin architecture for Claude Code
Claude Octopus integrates with Claude Code through a structured plugin system:Directory structure
Plugin lifecycle
Execution model
Provider coordination
Claude Octopus assigns specific roles to each AI provider:| Provider | Role | Best For |
|---|---|---|
| Codex (OpenAI) | Implementation depth | Code patterns, technical analysis, architecture design |
| Gemini (Google) | Ecosystem breadth | Research synthesis, alternatives, security review |
| Claude (Anthropic) | Orchestration & synthesis | Quality gates, consensus building, strategic synthesis |
Providers run with isolated environments (v8.7.0+) to prevent API key leakage and limit environment variable access.
Execution patterns
Different workflows use different execution patterns:- Parallel
- Sequential
- Adversarial
Discover & Deliver phasesBenefits: Faster execution, diverse perspectives
Quality gates
Quality gates enforce standards between workflow phases: Consensus threshold: 75% (configurable viaCLAUDE_OCTOPUS_QUALITY_THRESHOLD)
Measured by:
- Subtask success rate during development
- Agreement score across providers
- Validation score during review
Security architecture
Environment isolation (v8.7.0+)
External providers run with minimal environment access:Trust markers
Provider outputs are wrapped with trust indicators:Integrity verification
Result files are hashed (SHA-256) and tracked in.integrity-manifest to detect tampering.
Cost transparency
Visual indicators show exactly which providers are active:| Indicator | Meaning | Cost Source |
|---|---|---|
| 🐙 | Claude Octopus multi-AI mode active | Multiple APIs |
| 🔴 | Codex CLI executing | User’s OPENAI_API_KEY |
| 🟡 | Gemini CLI executing | User’s GEMINI_API_KEY |
| 🔵 | Claude subagent processing | Included with Claude Code |
Users can set
OCTOPUS_MAX_COST_USD to enforce spending limits. Workflows abort if estimated cost exceeds the threshold.Performance characteristics
Typical execution times
| Workflow | Duration | Provider Calls |
|---|---|---|
| discover | 30-60s | 2-3 |
| define | 1-2 min | 3-4 |
| develop | 3-7 min | 4-6 |
| deliver | 2-5 min | 3-5 |
| embrace (all 4) | 5-15 min | 12-18 |
| debate | 1-3 min | 3-9 |
Optimization strategies
Smart cost routing (v8.20.0+)
Smart cost routing (v8.20.0+)
Provider router selects cheapest capable provider based on:
- Bayesian trust scoring
- Historical performance
- Cost per token
- Capability matching
Prompt caching
Prompt caching
Repeated workflows reuse cached context:
- System prompts cached
- Project context cached
- Persona definitions cached
Parallel execution
Parallel execution
Research and review phases run providers simultaneously:
- 2x faster than sequential
- Better diversity of perspectives
Version compatibility
Claude Octopus detects Claude Code version and adapts features:| Claude Code Version | Features Unlocked |
|---|---|
| v2.1.12+ | Task management, bash wildcards |
| v2.1.16+ | Fork context, agent field |
| v2.1.32+ | Agent teams, auto memory |
| v2.1.33+ | Persistent memory, hook events |
| v2.1.36+ | Fast Opus 4.6 mode |
| v2.1.50+ | Worktree isolation, agents CLI |
| v2.1.63+ | HTTP hooks, batch command |
Feature detection is automatic. No manual configuration needed.
Next steps
Double Diamond methodology
Learn about the four-phase workflow structure
Multi-AI orchestration
Understand how providers coordinate and reach consensus
Personas
Explore the 33 specialized agent personas
Workflows
Discover workflow patterns and autonomy modes
