Skills

What are Skills?

Skills are Claude-powered code analysis agents that review your codebase. Each skill is a self-contained expert that knows what to look for and how to report it. Skills are defined using the agentskills.io specification and can be shared across projects. Think of skills as specialized reviewers: one might check for security issues, another for performance problems, and another for code style consistency.

How Skills Work

Discovery

Warden discovers skills from conventional directories in your repository:

.agents/skills/ (recommended)
.claude/skills/ (Claude Code convention)
.warden/skills/ (legacy)

Skills found in earlier directories take precedence when names conflict.

Resolution

Skills can be referenced by:

Name: find-warden-bugs (searches conventional directories)
Path: ./custom/my-skill.md or custom/my-skill/SKILL.md
Remote: owner/repo or owner/repo@sha (fetched from GitHub)

Execution

When a trigger matches, Warden:

Loads the skill definition (name, description, prompt, tools)
Prepares relevant files from the diff or repository
Analyzes each file hunk with the Claude SDK
Extracts findings from the model’s JSON output
Validates findings fall within analyzed line ranges
Deduplicates and merges related findings

Reporting

Findings are reported through:

CLI: Terminal output with severity badges and locations
GitHub Action: Inline PR comments with annotations
JSON/JSONL: Machine-readable output for CI integration
Log files: Detailed execution logs with timing and usage stats

Skill Structure

Skills follow the agentskills.io format with YAML frontmatter and Markdown body:

Directory Format (Recommended)

.agents/skills/
└── find-warden-bugs/
    ├── SKILL.md          # Skill definition (required)
    ├── scripts/          # Helper scripts (optional)
    ├── references/       # Reference docs (optional)
    └── assets/          # Images, examples (optional)

Flat File Format

.agents/skills/
└── find-warden-bugs.md   # Skill definition

Skill Definition Format

---
name: find-warden-bugs
description: "Detects bugs at architectural seams using historical patterns"
allowed-tools: Read Grep Glob
---

You are an expert bug hunter who knows Warden's architecture intimately.

## Scope

You receive scoped code chunks from Warden's diff pipeline. Analyze each
chunk against the checks below.

## Checks

### Check 1: SDK Response Shape Assumptions

**Severity:** high | **Historical commits:** 5+

Claude SDK responses have a specific shape...

[More detailed prompt content]

The name field in frontmatter must match your skill configuration in warden.toml.

Skill Types

Analysis Skills

Most skills analyze code changes and report findings:

---
name: security-audit
description: "Detects common security vulnerabilities"
allowed-tools: Read Grep
---

Analyze the code for:
- SQL injection risks
- XSS vulnerabilities
- Hardcoded credentials
...

Verification Skills

Some skills verify fixes or validate behavior:

---
name: test-coverage
description: "Ensures new code has tests"
allowed-tools: Read Glob Bash
---

For each new function:
1. Search for corresponding test files
2. Verify test coverage exists
3. Report if coverage is missing
...

Tool Configuration

Skills can restrict which tools Claude can use:

---
name: readonly-analyzer
allowed-tools: Read Grep Glob
---

Available tools:

Read - Read file contents
Write - Create/overwrite files
Edit - Modify existing files
Bash - Execute shell commands
Glob - Find files by pattern
Grep - Search file contents
WebFetch - Fetch URLs (if enabled)
WebSearch - Search the web (if enabled)

By default, skills have access to all tools. Use allowed-tools to restrict access and prevent unintended modifications.

Resource Directories

Skills can bundle additional resources:

`scripts/`

Executable scripts that Claude can call:

# .agents/skills/my-skill/scripts/analyze.sh
#!/bin/bash
grep -rn "TODO" src/

Reference in prompt:

Run `./scripts/analyze.sh` to find TODO comments.

`references/`

Documentation and reference materials:

references/
├── api-spec.md
├── architecture.md
└── style-guide.md

Reference in prompt:

Consult `references/style-guide.md` for naming conventions.

`assets/`

Images, examples, and other static files:

assets/
├── example-good.ts
└── example-bad.ts

Resource paths are resolved relative to the skill’s root directory (rootDir in the loaded definition).

Remote Skills

Skills can be loaded from GitHub repositories:

[[skills]]
name = "code-simplifier"
remote = "getsentry/sentry-skills"

[[skills]]
name = "security-audit"
remote = "org/[email protected]"  # Pin to specific tag

Remote skills are cached in:

macOS/Linux: ~/.cache/warden/skills/
Windows: %LOCALAPPDATA%\warden\skills\

Cache TTL:

Tags/SHAs: 30 days (immutable references)
Branches: 1 hour (may change)

How remote skill resolution works

Parse reference: owner/repo[@ref] where ref defaults to HEAD
Check cache: If cached and not expired, use cached version
Fetch archive: Download tarball from GitHub
Extract and discover: Find SKILL.md files in skills/ directory
Cache and load: Store in cache with metadata

Skill Configuration in `warden.toml`

Skills are configured with triggers, path filters, and output options:

[[skills]]
name = "find-warden-bugs"
paths = ["src/**/*.ts"]           # Only analyze TypeScript
ignorePaths = ["src/**/*.test.ts"] # Skip test files
failOn = "high"                    # Fail on high severity
reportOn = "medium"                # Show medium+
maxFindings = 50                   # Limit output
model = "claude-sonnet-4"          # Override model
maxTurns = 30                      # Limit API calls
minConfidence = "medium"           # Filter low confidence

[[skills.triggers]]
type = "pull_request"
actions = ["opened", "synchronize"]

See Configuration for complete schema.

Type Definitions

From src/config/schema.ts:24-33:

export const SkillDefinitionSchema = z.object({
  name: z.string().min(1),
  description: z.string(),
  prompt: z.string(),
  tools: ToolConfigSchema.optional(),
  /** Directory where the skill was loaded from */
  rootDir: z.string().optional(),
});
export type SkillDefinition = z.infer<typeof SkillDefinitionSchema>;

Tool configuration from src/config/schema.ts:5-22:

export const ToolNameSchema = z.enum([
  'Read', 'Write', 'Edit', 'Bash',
  'Glob', 'Grep', 'WebFetch', 'WebSearch',
]);

export const ToolConfigSchema = z.object({
  allowed: z.array(ToolNameSchema).optional(),
  denied: z.array(ToolNameSchema).optional(),
});

Best Practices

Keep Skills Focused

Each skill should have a single responsibility. Better to have 5 focused skills than 1 that does everything.

Use Historical Patterns

Base checks on real bugs from your project history. The best skills know what actually breaks.

Calibrate Confidence

Use confidence levels to distinguish proven issues (high) from possible concerns (medium/low).

Provide Examples

Include both good and bad examples in your prompt. Show what to look for and what to ignore.

Test with Evals

Write eval specs (YAML) to verify your skill catches known issues. See evals/README.md.

Next Steps

Triggers

Learn how to control when skills run

Findings

Understand how findings are structured and reported

Configuration

Explore all configuration options

Skill Creator

Use the skill-creator agent to write new skills

Get Started

Core Concepts

CLI Usage

GitHub Action

Configuration

Advanced

Skills

What are Skills?

How Skills Work

Skill Structure

Directory Format (Recommended)

Flat File Format

Skill Definition Format

Skill Types

Analysis Skills

Verification Skills

Tool Configuration

Resource Directories

`scripts/`

`references/`

`assets/`

Remote Skills

Skill Configuration in `warden.toml`

Type Definitions

Best Practices

Keep Skills Focused

Use Historical Patterns

Calibrate Confidence

Provide Examples

Test with Evals

Next Steps

Triggers

Findings

Configuration

Skill Creator

Build docs developers (and LLMs) love

Get Started

Core Concepts

CLI Usage

GitHub Action

Configuration

Skills

Advanced

​What are Skills?

​How Skills Work

​Skill Structure

​Directory Format (Recommended)

​Flat File Format

​Skill Definition Format

​Skill Types

​Analysis Skills

​Verification Skills

​Tool Configuration

​Resource Directories

​scripts/

​references/

​assets/

​Remote Skills

​Skill Configuration in warden.toml

​Type Definitions

​Best Practices

Keep Skills Focused

Use Historical Patterns

Calibrate Confidence

Provide Examples

Test with Evals

​Next Steps

Triggers

Findings

Configuration

Skill Creator

Build docs developers (and LLMs) love

What are Skills?

How Skills Work

Skill Structure

Directory Format (Recommended)

Flat File Format

Skill Definition Format

Skill Types

Analysis Skills

Verification Skills

Tool Configuration

Resource Directories

`scripts/`

`references/`

`assets/`

Remote Skills

Skill Configuration in `warden.toml`

Type Definitions

Best Practices

Next Steps