.agents/skills/ directory. These serve as examples and can be used directly in your projects.
find-warden-bugs
Purpose: Detect bugs at Warden’s architectural seams based on 40+ historical fix commits. Location:.agents/skills/find-warden-bugs/SKILL.md
What It Detects
This skill targets recurring bug patterns in Warden’s architecture:Check 1: SDK Response Shape Assumptions
Check 1: SDK Response Shape Assumptions
Severity: HighClaude SDK responses have specific shapes that have caused repeated issues. The skill detects:
- Accessing
response.content[0]without checking array length or block type - Accessing
msg.usage.input_tokenswithout null check on usage - Type predicates that silently filter unknown content types
- Accessing
cache_read_input_tokenswithout handling null - Parsing
SDKResultMessagewithout checkingis_errororsubtype
Check 2: Dual Code Path Desync
Check 2: Dual Code Path Desync
Severity: HighWarden has two paths that build
SkillReport objects:runSkill()insrc/sdk/analyze.ts(SDK/action)runSkillTask()insrc/cli/output/tasks.ts(CLI)
- Adding fields to
SkillReportin only one path - Different post-processing logic
- Inconsistent error handling
Check 3: Config Threading & Default Semantics
Check 3: Config Threading & Default Semantics
Severity: HighConfig flows through a 3-level merge chain. The skill detects:
- Breaking merge precedence (trigger > skill > defaults)
- Using
||when??is needed (0/false/"" are valid) - New config fields not threaded through
resolveSkillConfigs() emptyToUndefined()not applied to GitHub Actions inputs
Check 4: Concurrent Task & Ink Rendering
Check 4: Concurrent Task & Ink Rendering
Severity: HighSkills run concurrently via
runPool() while Ink renders a live UI. Detects:- Mutating shared state from callbacks without synchronization
- Sort comparators accessing external mutable state
- Writing to
process.stderrwhile Ink is rendering - Not checking
shouldAbort()after semaphore acquisition
Check 5: Output Rendering Consistency
Check 5: Output Rendering Consistency
Severity: MediumWarden renders in multiple formats (terminal, JSON, JSONL, GitHub checks). Detects:
- Display filtering applied before JSON serialization
--jsonflag short-circuiting before all findings collected- Reading log files that weren’t verified to exist
- GitHub annotations built from filtered findings
Other Checks
Other Checks
The skill includes additional checks for:
- Check 6: Scope & filtering logic (hunk line validation)
- Check 7: Early-exit path completeness (cleanup, output writes)
- Check 8: State tracking accuracy (counting operations correctly)
- Check 9: Error context & control flow (preserving error types)
Usage
warden.toml
Lessons for Your Skills
- Check-based structure: Each check targets a specific historical pattern
- Zone classification: Only run relevant checks based on file paths
- Safe patterns section: Reduces false positives
- Historical context: “Historical commits: 8+” shows this is a real problem
- Severity tied to impact: High = normal usage breaks, Medium = edge cases
architecture-review
Purpose: Staff-level codebase health review. Finds structural issues that compound over time. Location:.agents/skills/architecture-review/SKILL.md
What It Analyzes
1. Module Complexity
1. Module Complexity
Finds files that have grown too large or do too much:
- Files >500 lines (investigate >800)
- Modules with >3 distinct responsibilities
- High fan-out (importing from 10+ modules)
2. Silent Failure Patterns
2. Silent Failure Patterns
Code that fails without indication:
- Catch blocks returning defaults without logging
- Functions returning
[]ornullwhere caller can’t distinguish error from empty - Missing error callbacks on async operations
- Silent fallbacks hiding upstream problems
3. Type Safety Gaps
3. Type Safety Gaps
Places where TypeScript safety is bypassed:
as SomeTypewithout runtime validation- Regex match assertions without checking capture groups
- Optional chaining (
?.) hiding null sources - Generic index access:
obj[key]
4. Test Coverage Analysis
4. Test Coverage Analysis
Maps tested vs critical:
- Untested critical paths (core logic, orchestration, error handling)
- Edge case gaps (empty inputs, null values, boundaries)
- Integration gaps (cross-module flows with only unit tests)
- Regression coverage (bug fixes without tests)
5. LLM-Friendliness
5. LLM-Friendliness
How well the code supports AI-assisted development:
- JSDoc coverage on exports
- Naming clarity (understandable without reading implementation)
- Actionable error messages
- Configuration footguns
Usage
warden.toml
Output Format
Generates a structured report:Lessons for Your Skills
- Macro over micro: Focus on structural issues, not style preferences
- Pre-report checklist: Validates work before reporting
- Risk prioritization: Hot paths > edge cases > utilities
- Positive feedback: “What’s Working Well” preserves good patterns
testing-guidelines
Purpose: Guide for writing tests. Used when adding functionality or fixing bugs. Location:.agents/skills/testing-guidelines/SKILL.md
Core Principles
Mock External Services, Use Real Fixtures
Always mock third-party network services. Always use fixtures based on real-world data (sanitized).
Prefer Integration Tests Over Unit Tests
Focus on end-to-end tests validating inputs and outputs, not implementation details.
Always Add Regression Tests for Bugs
When a bug is found, add a test that would have caught it. Test should fail before fix, pass after.
Usage
Reference this skill when writing tests:Lessons for Your Skills
- Principle-based: Clear numbered principles, not exhaustive checklists
- Concrete examples: Shows fixture format, test structure
- Checklist: Pre-submission validation
agent-prompt
Purpose: Reference guide for writing effective agent prompts and skills. Location:.agents/skills/agent-prompt/SKILL.md
Structure
This skill acts as a router to detailed reference files:Usage
Lessons for Your Skills
- Reference architecture: Main skill routes to specialized docs
- Table-based routing: “Read X when doing Y” guides context selection
- Bundled resources: References live alongside the skill
Using Builtin Skills
Reference in Your Config
warden.toml
Study Their Patterns
The builtin skills demonstrate:- Specificity: Each targets concrete, provable patterns
- Structure: Check-based or principle-based organization
- Examples: Red flags, safe patterns, not-a-bug sections
- Calibration: Confidence thresholds and severity guidance
- Historical grounding: Reference past bugs to sharpen detection
Copy and Adapt
Use builtin skills as templates:Next Steps
Creating Skills
Write your own skill based on these examples
Remote Skills
Use skills from other repositories