Quick Wins
Add to~/.claude/settings.json:
Model Selection
Sonnet for Daily Development
Use Sonnet as your default model. It handles 80%+ of coding tasks and costs ~60% less than Opus.
| Setting | Default | Recommended | Impact |
|---|---|---|---|
model | opus | sonnet | ~60% cost reduction |
| Task Coverage | - | 80%+ | Most coding tasks |
When to Switch to Opus
- Complex architectural decisions
- Deep debugging sessions
- Multi-system refactoring
- First-principles problem solving
Thinking Token Limits
Claude’s “thinking” happens behind the scenes and consumes tokens you don’t see.Reduce Thinking Tokens
- Code review
- Bug fixes
- Feature implementation
- Refactoring
- Large-scale architecture decisions
- Complex debugging across many files
Auto-Compaction Strategy
Context windows fill up during long sessions. Claude auto-compacts at 95% by default, but this is too late.Compact Earlier
- Better quality in long sessions
- Prevents context degradation
- More aggressive cleanup of irrelevant context
Compaction at 95% means you’ve already filled 190k of your 200k window. Compacting at 50% gives Claude more room to work.
Manual Compaction
Use/compact at logical breakpoints instead of relying on auto-compaction.
When to Compact
After Research Phase
You’ve explored the codebase, found what you need. Compact before implementing.
When NOT to Compact
Context Window Management
Each MCP tool description consumes tokens from your 200k window.Critical: Too many MCPs can reduce your effective window from 200k to ~70k.
MCP Best Practices
- Keep under 10 MCPs enabled per project
- Keep under 80 tools active total
- Disable unused MCPs in project config
Check Active Tools
Subagent Model Selection
Subagents handle delegated tasks. Use Haiku for routine work.- Code review (code-reviewer agent)
- Build error resolution (build-error-resolver agent)
- Documentation updates (doc-updater agent)
- Test generation (tdd-guide agent for simple cases)
- Complex architecture (architect agent)
- Security audits (security-reviewer agent)
- Multi-file refactoring
Daily Workflow Commands
| Command | When to Use | Cost Impact |
|---|---|---|
/model sonnet | Default for most tasks | 60% cheaper than Opus |
/model opus | Complex architecture, deep debugging | Full cost, use sparingly |
/clear | Between unrelated tasks | Free instant reset |
/compact | Logical task breakpoints | Reduces context, improves quality |
/cost | Monitor spending | Visibility into token usage |
Example Workflow
Agent Teams Warning
Only use Agent Teams when:- Parallelism provides clear value (multi-module work)
- Parallel reviews (security + code quality)
/plan→ planner agent (single context)/code-review→ code-reviewer agent (single context)
Cost Monitoring
Check Current Usage
Track Over Time
Monitor your Claude Code dashboard:- Daily usage trends
- Per-project costs
- Model distribution (Opus vs Sonnet)
Strategic Compaction Skill
ECC includes astrategic-compact skill that suggests /compact at logical breakpoints.
See skills/strategic-compact/SKILL.md for the full decision guide.
Compaction Decision Tree
Summary: Optimal Settings
Cost Reduction Checklist
- Default model set to sonnet
-
MAX_THINKING_TOKENSreduced to 10000 - Auto-compact threshold at 50%
- Subagent model set to haiku
- Unused MCPs disabled per project
- Total MCPs under 10
- Total tools under 80
- Using
/clearbetween unrelated tasks - Using
/compactat logical breakpoints - Using
/costto monitor spending - Opus usage <20% of total
Expected Savings: 60-70% cost reduction with these optimizations applied.