Working with workflows
Claude Octopus structures work using the Double Diamond methodology—four phases that move from divergent exploration to convergent delivery. This guide covers workflow progression, quality gates, and choosing the right workflow.The Double Diamond
Adapted from the UK Design Council’s framework, the Double Diamond ensures quality through structured phases:Four phases
Discover (Probe)
Purpose: Divergent research and explorationActivities:
- Multi-provider research (Codex + Gemini + Claude)
- Broad ecosystem analysis
- Technology comparison
- Best practices research
- Community insights
/octo:discover or /octo:probeVisual indicator: 🐙 🔍Define (Grasp)
Purpose: Convergent consensus buildingActivities:
- Synthesize research findings
- Build consensus on approach
- Define requirements clearly
- Identify constraints
- Establish success criteria
/octo:define or /octo:graspVisual indicator: 🐙 🎯Develop (Tangle)
Purpose: Divergent implementationActivities:
- Multi-provider code generation
- Implementation with quality gates
- Testing and validation
- Security review
- Performance optimization
/octo:develop or /octo:tangleVisual indicator: 🐙 🛠️Running individual phases vs full workflows
Individual phases
Run phases individually for maximum control:- You want to review output before proceeding
- Requirements may change between phases
- High-stakes features requiring oversight at each step
- Learning or experimenting with the methodology
Full workflow (Embrace)
Run all 4 phases automatically:- Discover: Multi-provider research
- Define: Consensus building on approach
- Develop: Implementation with quality gates
- Deliver: Final validation and review
- Clear requirements from the start
- Trusted, well-understood features
- Autonomous mode enabled (see below)
- You want end-to-end workflow without interruptions
Autonomy modes
Configure how much oversight you want during embrace workflows:- Supervised (Default)
- Semi-Autonomous
- Autonomous
Approval required after each phase
- Maximum control and oversight
- Review synthesis before proceeding to next phase
- Best for critical features or learning
Choosing the right workflow
Claude Octopus provides specialized workflows beyond the Double Diamond phases.Workflow decision tree
I want to research a topic thoroughly
I want to research a topic thoroughly
Use:
/octo:research or /octo:discoverWhat you get:- Multi-AI research (Codex + Gemini + Claude)
- Comprehensive analysis of options
- Trade-off evaluation
- Best practice identification
I want to debate two approaches
I want to debate two approaches
Use:
/octo:debateWhat you get:- Structured three-way AI debate
- Technical perspective (Codex)
- Ecosystem perspective (Gemini)
- Moderator and synthesis (Claude)
- Consensus score
I want to build a feature end-to-end
I want to build a feature end-to-end
Use:
/octo:embraceWhat you get:- Full 4-phase workflow
- Quality gates between phases
- Multi-AI perspectives throughout
- Configurable autonomy
I want to review existing code
I want to review existing code
Use:
/octo:reviewWhat you get:- Multi-AI code review
- Security vulnerability detection
- 4-dimension scoring (correctness, security, performance, maintainability)
- Best practices enforcement
I want to write tests first, then code
I want to write tests first, then code
Use:
/octo:tddWhat you get:- Red-green-refactor discipline
- Tests written before implementation
- Incremental feature development
- Continuous validation
I want to scan for vulnerabilities
I want to scan for vulnerabilities
Use:
/octo:securityWhat you get:- OWASP Top 10 vulnerability scanning
- Authentication/authorization review
- Input validation checks
- Red team analysis
I want to go from spec to shipping code
I want to go from spec to shipping code
Use:
/octo:factoryWhat you get:- Autonomous spec-to-software pipeline
- Holdout testing (80/20 split)
- Satisfaction scoring
- PASS/WARN/FAIL verdict
I want to debug a tricky issue
I want to debug a tricky issue
Use:
/octo:debugWhat you get:- Systematic debugging
- Evidence gathering
- Root cause identification
- Fix with verification
I just want to run something quick
I just want to run something quick
Use:
/octo:quickWhat you get:- Lightweight, single-phase execution
- No multi-AI overhead
- Fast results
Workflow progression and quality gates
Quality gates ensure sloppy work doesn’t advance to the next phase.Quality gate thresholds
Discover
Gate: All providers responded successfullyChecks:
- Codex CLI returned valid synthesis
- Gemini CLI returned valid synthesis
- Claude synthesis completed
- Retry with timeout increase
- Proceed with available providers
- User review in semi-autonomous mode
Define
Gate: Consensus achieved (75%+ agreement)Checks:
- Requirements clearly defined
- Constraints identified
- Success criteria established
- 75% consensus across providers
- Re-run define with clarifying questions
- User review required
Develop
Gate: Security, performance, best practices validatedChecks:
- No critical security issues
- Performance within acceptable range
- Best practices followed
- Tests written and passing
- Remediation with context from validation report
- Re-run develop phase
- User review in semi-autonomous mode
Deliver
Gate: Final quality certification passedChecks:
- All acceptance criteria met
- No blocking issues
- Documentation complete
- Go/no-go recommendation
- Provide detailed failure report
- Suggest remediation steps
- User decision on next steps
75% consensus threshold
The develop (tangle) phase requires 75% consensus across AI providers before advancing: Example:Examples from real use cases
Use case 1: API authentication research
Goal: Research OAuth 2.0 vs JWT authentication for a new API Workflow:- Codex analysis: Technical implementation details, security considerations
- Gemini analysis: Ecosystem adoption, library support, community insights
- Claude synthesis: Comparison table, recommendations based on use case
Use case 2: End-to-end feature development
Goal: Build complete user authentication system from research to delivery Workflow:- Discover (research): OAuth patterns, JWT, session management → Approved
- Define (consensus): OAuth 2.0 + JWT + refresh tokens → 82% consensus → Approved
- Develop (implementation): Auth endpoints, token generation, validation → Security validated → Approved
- Deliver (validation): Code review passed, security scan clean → Go recommendation → Shipped
- All 4 gates passed
- No security issues found
- Performance within acceptable range (< 200ms token validation)
Use case 3: Architectural decision debate
Goal: Decide between monorepo and microservices architecture Workflow:- Round 1: Opening arguments (Codex: microservices, Gemini: monorepo, Claude: moderates)
- Round 2: Rebuttals and counterarguments
- Round 3: Final synthesis and consensus
Use case 4: Security audit and remediation
Goal: Audit authentication module for OWASP vulnerabilities Workflow:- 2 critical issues (JWT secret hardcoded, no rate limiting)
- 3 medium issues (weak password validation, missing CSRF protection)
- Remediation applied with adversarial review
- Final scan: 0 critical issues
Use case 5: Spec-to-software pipeline
Goal: Build CLI tool from specification with holdout testing Workflow:- Parse spec → extracted 12 behaviors
- Generate scenarios → 30 test scenarios created
- Split holdout → 24 training, 6 blind scenarios
- Embrace workflow → full implementation
- Holdout tests → 5/6 passed (83% holdout accuracy)
- Score satisfaction → 0.87 composite score
- Report → WARN verdict (below 0.90 target)
- Reviewed failed holdout scenario
- Re-ran factory with refined spec
- Second run: 0.92 composite score → PASS
Next steps
Using commands
Learn command structure, the smart router, and command composition
Configuring providers
Set up Codex, Gemini, and configure provider selection
Configuration
Environment variables, autonomy modes, and custom hooks
Architecture
Deep dive into the Double Diamond methodology
