Common Issues

Workers not starting or failing immediately

Symptoms: Tasks stuck in “Pending” or immediately fail after spawnCommon causes:

Missing Claude Code installation
- Workers are spawned as ACP (Agent Client Protocol) subprocesses
- Verify Claude Code is installed and accessible
- Check DEBUG logs for subprocess spawn errors
Node.js not available
- Some ACP agents require Node.js runtime
- Verify: node --version
Infrastructure broken flag set
- If a cp command fails during worker spawn, the infra_broken flag is set
- All subsequent spawns auto-fail to avoid repeated errors
- Check logs for “copy failed” or “infra_broken” messages
- Solution: Fix the underlying issue (disk space, permissions) and restart Enki

Check the logs:

# Look for spawn failures
grep "spawn" ~/.enki/logs/enki.log | tail -n 50

# Check for copy errors
grep "copy\|infra_broken" ~/.enki/logs/enki.log

Merge conflicts during task completion

Symptoms: Worker completes successfully but merge fails, task enters Failed stateWhat happens:

When a worker completes, Enki fetches its task/<id> branch from the worker’s copy
The refinery attempts to merge it into your main branch
If there’s a conflict, Enki spawns a merger agent with minimal tools to resolve it

Merger agent flow:

Conflict detected → MergeNeedsResolution event
Separate ACP session spawned with MERGER_TOOLS (minimal tool access)
Works in a shared temp clone (kept alive via CleanupGuard + std::mem::forget)
Agent resolves conflict and commits
Merge completes → MergeDone advances the DAG

If merger fails:

Check .enki/verify.sh hook (if present) — non-zero exit fails the merge
Review merge request logs in the database
Check session logs: ~/.enki/logs/sessions/<merger-session>.log

Manual resolution:

# Check pending merge requests
sqlite3 .enki/db.sqlite "SELECT * FROM merge_requests WHERE status='pending';"

# Inspect the branch
git log task/<id>
git diff main...task/<id>

Copy-on-Write failures

Symptoms: “copy failed” errors, workers not getting isolated copiesAbout CoW copies:

Each worker gets a full filesystem copy at .enki/copies/<task_id>/
Uses platform-specific CoW for instant, space-efficient clones:
- macOS/APFS: cp -Rc
- Linux: cp --reflink=auto -a
Excludes .enki/ from copies

Common causes:

Insufficient disk space
- Even with CoW, metadata and unique files consume space
- Check: df -h
Filesystem doesn’t support CoW
- Linux: Requires btrfs or XFS (with recent kernel)
- macOS: Requires APFS
- On unsupported filesystems, falls back to full copy (slow + space-heavy)
Permission issues
- Check .enki/copies/ directory permissions
- Ensure user can write to project directory

Fix:

# Check filesystem type
df -T .  # Linux
diskutil info / | grep "File System"  # macOS

# Clean up old copies if space is low
rm -rf .enki/copies/task-*

# Restart Enki to clear infra_broken flag

Stale workers / timeout issues

Symptoms: Workers appear stuck, no progress for extended periodHow monitoring works:

The Monitor state machine tracks worker activity via ACP updates
Workers with no update for STALE_CANCEL_SECS get cancelled:
- Standard tier: 120 seconds default
Retry budget: up to 3 retries per task (MAX_TASK_RETRIES)
After max retries, task blocks to prevent infinite loops

Check worker activity:

# Recent worker activity in logs
tail -n 200 ~/.enki/logs/enki.log | grep "worker\|session"

# Session-specific activity (JSON-RPC traffic)
cat ~/.enki/logs/sessions/<label>.log

If workers are genuinely stuck:

Use MCP enki_stop_all tool to cancel all running workers
Check for deadlocks in session logs
Review task complexity tier assignment (heavy tasks get more time)

Manual intervention:

# Stop all workers via database
sqlite3 .enki/db.sqlite "UPDATE tasks SET status='Cancelled' WHERE status='Running';"

# Restart Enki

Database corruption or state issues

Symptoms: Crashes on startup, inconsistent task states, migration errorsDatabase details:

SQLite in WAL mode: .enki/db.sqlite
DAG stored as JSON blob in executions table
Auto-migration on every DB open (no version files)

Recovery steps:

Backup first:

cp .enki/db.sqlite .enki/db.sqlite.backup

Check integrity:

sqlite3 .enki/db.sqlite "PRAGMA integrity_check;"

If corrupted, re-initialize (destroys task history):

rm .enki/db.sqlite
rm .enki/db.sqlite-shm
rm .enki/db.sqlite-wal
# Restart Enki — will auto-create fresh DB

Abandoned tasks (DB-only state):
- Set on session exit for in-flight tasks
- Never enters the DAG runtime
- Safe to ignore or manually clean up:
  sqlite3 .enki/db.sqlite "DELETE FROM tasks WHERE status='Abandoned';"

Permission denied errors

Symptoms: “Permission denied” when accessing files, creating copies, or running workersCommon causes:

Project directory permissions
- Enki needs write access to .enki/ subdirectory
- Check: ls -la .enki/
Worker copy permissions
- CoW copies preserve original file permissions
- If source files are read-only for the user, workers can’t modify them
Git repository permissions
- Workers create branches and commits
- Check: git config user.name && git config user.email

Fix:

# Ensure .enki/ is writable
chmod -R u+w .enki/

# Check git config
git config --list | grep user

# Fix git identity if needed
git config user.name "Your Name"
git config user.email "[email protected]"

Signal file processing issues

Symptoms: MCP tool calls succeed but nothing happens, tasks don’t startHow signals work:

MCP server writes .enki/events/sig-*.json files
Coordinator polls on 3-second tick and processes them
Files are deleted after processing
No filesystem notifications (no fsnotify)

Debug:

# Check for stuck signal files
ls -la .enki/events/

# Watch signal processing in real-time
tail -f ~/.enki/logs/enki.log | grep "signal\|CheckSignals"

# Manually inspect a signal file
cat .enki/events/sig-*.json

Common issues:

Coordinator not running (TUI exited)
Permission issues on .enki/events/ directory
Malformed JSON in signal file (MCP server bug)

Fix:

# Clean up stuck signals
rm .enki/events/sig-*.json

# Restart Enki TUI

Contributing

Troubleshooting

Common Issues

Quick Diagnostics

Common Problems

Still Having Issues?

Useful Diagnostic Commands

Build docs developers (and LLMs) love

Contributing

Troubleshooting

​Quick Diagnostics

​Common Problems

​Still Having Issues?

​Useful Diagnostic Commands

Build docs developers (and LLMs) love

Quick Diagnostics

Common Problems

Still Having Issues?

Useful Diagnostic Commands