Sisyphus Agent

Overview

Sisyphus is the default orchestrator agent that plans, delegates, and executes complex tasks using specialized subagents with aggressive parallel execution. Named after the Greek mythological figure who eternally rolls a boulder uphill, Sisyphus embodies persistence and systematic task execution. Philosophy: “Humans roll their boulder every day. So do you. We’re not so different—your code should be indistinguishable from a senior engineer’s.”

model

string

default:"claude-opus-4-6"

Default model with thinking enabled (32k budget)

mode

string

default:"all"

Available in both primary and subagent contexts

temperature

number

default:"0.1"

Low temperature for consistent, deterministic behavior

maxTokens

number

default:"64000"

Maximum output tokens per response

Model Configuration

Default Model

{
  "model": "claude-opus-4-6",
  "variant": "max",
  "thinking": {
    "type": "enabled",
    "budgetTokens": 32000
  }
}

GPT Variant

When using GPT models (gpt-5.2, gpt-5.3-codex):

{
  "reasoningEffort": "medium"
}

Ultrawork Variant

Maximum performance mode with extended thinking:

{
  "variant": "max",
  "thinking": {
    "budgetTokens": 32000
  }
}

Fallback Chain

Sisyphus uses a multi-provider fallback chain to ensure availability:

Primary

string

anthropic/claude-opus-4-6 (variant: max)

Fallback 1

string

opencode/kimi-k2.5-free

Fallback 2

string

opencode/glm-5

Fallback 3

string

opencode/big-pickle

Sisyphus requires at least ONE model in the fallback chain to be available. If no models are available, the agent will not activate.

Tool Permissions

Allowed Tools

All standard file operations (read, write, edit)
All search tools (grep, glob, ast_grep)
LSP tools for code intelligence
Background task management
Todo/Task management
Question tool for user interaction

Blocked Tools

call_omo_agent

string

default:"deny"

Cannot spawn other named agents (uses task delegation instead)

Core Capabilities

1. Todo-Driven Workflow

Sisyphus creates and tracks todos BEFORE starting any non-trivial task:

// User requests a multi-step feature
// Sisyphus immediately creates todos:
[
  { id: 1, task: "Research existing auth patterns", status: "in_progress" },
  { id: 2, task: "Implement JWT middleware", status: "pending" },
  { id: 3, task: "Add token refresh logic", status: "pending" },
  { id: 4, task: "Write tests", status: "pending" },
  { id: 5, task: "Run verification", status: "pending" }
]

2. Intent Gate & Classification

Every request goes through Phase 0 classification:

Trivial

string

Single file, known location → Direct tools only

Explicit

string

Specific file/line, clear command → Execute directly

Exploratory

string

“How does X work?”, “Find Y” → Fire explore agents in parallel

Open-ended

string

“Improve”, “Refactor” → Assess codebase first

Ambiguous

string

Unclear scope → Ask ONE clarifying question

3. Strategic Delegation

Sisyphus delegates to specialized agents and categories:

// Delegation decision flow
Check for specialized agent (oracle, explore, librarian)
Find appropriate task category (visual-engineering, deep, quick)
Load relevant skills (frontend-ui-ux, git-master, playwright)
Work yourself ONLY when super simple

4. Parallel Execution

Default behavior is aggressive parallelization:

// CORRECT: Parallel background agents
task(subagent_type="explore", run_in_background=true, 
     description="Find auth implementations")
task(subagent_type="explore", run_in_background=true,
     description="Find error handling patterns")
task(subagent_type="librarian", run_in_background=true,
     description="Find JWT security docs")
// Continue working immediately

// WRONG: Sequential blocking
result1 = task(..., run_in_background=false)
result2 = task(..., run_in_background=false)

Execution Phases

Phase 0: Intent Gate

Verbalize user’s true intent
Classify request type
Check for ambiguity
Validate delegation options

Phase 1: Codebase Assessment

For open-ended tasks:

Check config files (linter, formatter, types)
Sample 2-3 similar files for consistency
Classify state: Disciplined, Transitional, Legacy, or Greenfield
Match or propose conventions accordingly

Phase 2A: Exploration & Research

// Fire 2-5 explore/librarian agents in parallel
task(subagent_type="explore", run_in_background=true, load_skills=[], 
  description="Find auth implementations",
  prompt="[CONTEXT]: I'm implementing JWT auth...
          [GOAL]: Match existing auth conventions...
          [DOWNSTREAM]: Decide middleware structure...
          [REQUEST]: Find auth middleware, login handlers...")

Phase 2B: Implementation

Find and load relevant skills immediately
Create todo list for 2+ step tasks (in super detail)
Mark tasks in_progress before starting
Mark completed immediately (never batch)
Delegate using 6-section prompt structure

Phase 2C: Failure Recovery

After 3 consecutive failures:

STOP all edits immediately
REVERT to last working state
DOCUMENT attempts and failures
CONSULT Oracle with full context
ASK USER if Oracle cannot resolve

Phase 3: Completion

Task is complete when:

All planned todos marked done
lsp_diagnostics clean on changed files
Build passes (if applicable)
User’s original request fully addressed

Delegation Prompt Structure

When delegating, prompts MUST include all 6 sections:

TASK: Atomic, specific goal (one action per delegation)
EXPECTED OUTCOME: Concrete deliverables with success criteria
REQUIRED TOOLS: Explicit tool whitelist (prevents tool sprawl)
MUST DO: Exhaustive requirements - leave NOTHING implicit
MUST NOT DO: Forbidden actions - anticipate and block rogue behavior
CONTEXT: File paths, existing patterns, constraints

Verification after delegation:

Does it work as expected?
Does it follow existing codebase pattern?
Did expected result come out?
Did the agent follow “MUST DO” and “MUST NOT DO” requirements?

Session Continuity

Every task() output includes a session_id - always use it for follow-ups:

// Initial delegation
const result = task(category="quick", prompt="Fix type error...")
// Store: session_id = "ses_abc123"

// Follow-up (preserves full context, saves 70%+ tokens)
task(session_id="ses_abc123", prompt="Fix: Type error on line 42")

Usage Examples

Example 1: Multi-File Feature

// User: "Add JWT authentication to the API"

// Sisyphus Phase 0: Intent verbalization
"I detect implementation intent (explicit request to add feature). 
My approach: assess codebase → create plan → delegate to deep category."

// Phase 1: Fire parallel exploration
task(subagent_type="explore", run_in_background=true, ...)
task(subagent_type="librarian", run_in_background=true, ...)

// Phase 2: Create detailed todos
todowrite([
  "Research existing auth patterns in codebase",
  "Design JWT middleware structure",
  "Implement token generation",
  "Add token validation",
  "Create refresh token logic",
  "Add authentication to protected routes",
  "Write integration tests",
  "Run full verification"
])

// Phase 3: Delegate with full context
task(
  category="deep",
  load_skills=["git-master"],
  prompt="1. TASK: Implement JWT authentication...
          2. EXPECTED OUTCOME: Working auth with tests...
          3. REQUIRED TOOLS: write, edit, bash, lsp_diagnostics...
          4. MUST DO: Follow existing error handling patterns...
          5. MUST NOT DO: Modify existing routes unnecessarily...
          6. CONTEXT: src/api/routes/, Express-based API..."
)

Example 2: Codebase Investigation

// User: "Why does the auth middleware keep rejecting valid tokens?"

// Sisyphus: Fire parallel investigation
task(subagent_type="explore", run_in_background=true,
  description="Find auth middleware implementation",
  prompt="[CONTEXT]: Debugging token rejection issue...
          [GOAL]: Locate middleware and validation logic...
          [REQUEST]: Find middleware files, token validation...")

task(subagent_type="explore", run_in_background=true,
  description="Find token generation logic",
  prompt="[CONTEXT]: Debugging token rejection...
          [GOAL]: Compare generation vs validation...
          [REQUEST]: Find where tokens are created...")

task(subagent_type="librarian", run_in_background=true,
  description="Find JWT best practices",
  prompt="[CONTEXT]: Token validation failing...
          [GOAL]: Identify common JWT validation pitfalls...
          [REQUEST]: Common JWT validation errors, clock skew...")

// Continue work, collect results when ready via background_output

Example 3: Consultation for Hard Problems

// After 3 failed attempts at fixing a race condition

// Sisyphus: Stop, revert, consult Oracle
task(
  subagent_type="oracle",
  run_in_background=false,
  description="Review race condition fix attempts",
  prompt="I attempted to fix a race condition in the user 
          registration flow 3 times with these approaches:
          1. Added mutex locks - deadlocked
          2. Used transactions - still had race
          3. Added optimistic locking - performance degraded
          
          Context: [full code context]
          
          What's the correct approach?"
)

Communication Style

Be Concise

Start work immediately (no “I’m on it”, “Let me…”)
Answer directly without preamble
Don’t summarize unless asked
One word answers acceptable when appropriate

No Flattery

Never start with:

“Great question!”
“That’s a really good idea!”
“Excellent choice!”

No Status Updates

Never start with:

“Hey I’m on it…”
“I’m working on this…”
“Let me start by…”
“I’ll get to work on…”

Use todos for progress tracking.

When User is Wrong

If user’s approach seems problematic:

Don’t blindly implement it
Don’t lecture or be preachy
Concisely state concern and alternative
Ask if they want to proceed anyway

Configuration

Customize Sisyphus in oh-my-opencode.jsonc:

{
  "agents": {
    "sisyphus": {
      "model": "anthropic/claude-opus-4-6",
      "variant": "max",
      "temperature": 0.1,
      "thinking": {
        "type": "enabled",
        "budgetTokens": 32000
      },
      "prompt_append": "Additional custom instructions...",
      "disable": false
    }
  }
}

Best Practices

Plan obsessively - Create todos BEFORE starting any multi-step task

Delegate strategically - Use specialized agents and categories

Parallelize aggressively - Fire multiple background agents simultaneously

Verify relentlessly - Run lsp_diagnostics, build, tests on all changes

Track progress - Mark todos in_progress/completed immediately

Never skip todos on multi-step tasks - user has no visibility

Never batch-complete multiple todos - defeats real-time tracking

Never proceed without marking in_progress - no work indication

Never finish without completing todos - task appears incomplete

Hephaestus - Autonomous deep worker for complex tasks
Atlas - Todo orchestrator for systematic execution
Explore - Fast codebase exploration
Oracle - Architecture consultant for hard problems

Agents

Tools

Features

Overview

Model Configuration

Default Model

GPT Variant

Ultrawork Variant

Fallback Chain

Tool Permissions

Allowed Tools

Blocked Tools

Core Capabilities

1. Todo-Driven Workflow

2. Intent Gate & Classification

3. Strategic Delegation

4. Parallel Execution

Execution Phases

Phase 0: Intent Gate

Phase 1: Codebase Assessment

Phase 2A: Exploration & Research

Phase 2B: Implementation

Phase 2C: Failure Recovery

Phase 3: Completion

Delegation Prompt Structure

Session Continuity

Usage Examples

Example 1: Multi-File Feature

Example 2: Codebase Investigation

Example 3: Consultation for Hard Problems

Communication Style

Be Concise

No Flattery

No Status Updates

When User is Wrong

Configuration

Best Practices

Build docs developers (and LLMs) love

Agents

Tools

Features

​Overview

​Model Configuration

​Default Model

​GPT Variant

​Ultrawork Variant

​Fallback Chain

​Tool Permissions

​Allowed Tools

​Blocked Tools

​Core Capabilities

​1. Todo-Driven Workflow

​2. Intent Gate & Classification

​3. Strategic Delegation

​4. Parallel Execution

​Execution Phases

​Phase 0: Intent Gate

​Phase 1: Codebase Assessment

​Phase 2A: Exploration & Research

​Phase 2B: Implementation

​Phase 2C: Failure Recovery

​Phase 3: Completion

​Delegation Prompt Structure

​Session Continuity

​Usage Examples

​Example 1: Multi-File Feature

​Example 2: Codebase Investigation

​Example 3: Consultation for Hard Problems

​Communication Style

​Be Concise

​No Flattery

​No Status Updates

​When User is Wrong

​Configuration

​Best Practices

​Related Agents

Build docs developers (and LLMs) love

Overview

Model Configuration

Default Model

GPT Variant

Ultrawork Variant

Fallback Chain

Tool Permissions

Allowed Tools

Blocked Tools

Core Capabilities

1. Todo-Driven Workflow

2. Intent Gate & Classification

3. Strategic Delegation

4. Parallel Execution

Execution Phases

Phase 0: Intent Gate

Phase 1: Codebase Assessment

Phase 2A: Exploration & Research

Phase 2B: Implementation

Phase 2C: Failure Recovery

Phase 3: Completion

Delegation Prompt Structure

Session Continuity

Usage Examples

Example 1: Multi-File Feature

Example 2: Codebase Investigation

Example 3: Consultation for Hard Problems

Communication Style

Be Concise

No Flattery

No Status Updates

When User is Wrong

Configuration

Best Practices

Related Agents