Agent System

Forge uses a flexible agent system where each agent is a specialized AI entity with specific capabilities, tools, and instructions. Agents can work independently or collaborate to solve complex problems.

What is an Agent?

An agent in Forge is a configured AI instance with:

Unique identity - Each agent has an ID and optional title
Model configuration - Which LLM provider and model to use
System prompt - Instructions that define the agent’s behavior and expertise
Tool access - Which tools the agent can use
Custom rules - Guidelines specific to this agent’s domain

Agent Anatomy

Here’s what defines an agent in Forge:

id: code-reviewer
title: Code Review Specialist
description: Reviews code for quality, security, and best practices
model: claude-3.7-sonnet
tools:
  - read
  - grep
  - glob
system_prompt: |
  You are a senior code reviewer focused on:
  - Security vulnerabilities
  - Performance issues
  - Code maintainability
  - Best practices compliance
custom_rules: |
  Always check for:
  1. SQL injection vulnerabilities
  2. Proper error handling
  3. Input validation
temperature: 0.3
max_tokens: 4000

Built-in Agents

Forge comes with three built-in agents:

Forge Agent

The default general-purpose agent that handles most coding tasks.

AgentId::FORGE

Best for: General development tasks, debugging, refactoring, and feature implementation.

Muse Agent

A creative agent optimized for exploring ideas and generating solutions.

AgentId::MUSE

Best for: Brainstorming, architecture design, and creative problem-solving.

Sage Agent

A knowledge-focused agent that excels at explaining concepts and answering questions.

AgentId::SAGE

Best for: Code understanding, documentation, and learning new technologies.

Agent Configuration

Model Settings

Control how the agent generates responses:

temperature: 0.7  # 0.0 (focused) to 2.0 (creative)
top_p: 0.9        # Nucleus sampling
top_k: 50         # Token selection diversity
max_tokens: 4000  # Maximum response length

Lower temperature (0.0-0.3) produces more deterministic, focused responses. Higher temperature (0.7-2.0) generates more creative, diverse outputs.

Tool Configuration

Specify which tools an agent can use:

tools:
  - read        # Read files
  - write       # Write files
  - edit        # Edit existing files
  - bash        # Execute shell commands
  - grep        # Search file contents
  - glob        # Find files by pattern

Only grant agents the tools they need. Limiting tool access improves security and reduces the chance of unintended actions.

Turn Limits

Prevent infinite loops by setting turn limits:

max_turns: 50
max_tool_failure_per_turn: 3
max_requests_per_turn: 50

Custom Agents

You can create custom agents for specialized tasks. For example, a database migration agent:

id: db-migrator
title: Database Migration Specialist
description: Creates and manages database migrations
model: claude-3.7-sonnet
tools:
  - read
  - write
  - bash
system_prompt: |
  You are a database migration expert. When creating migrations:
  1. Always include rollback procedures
  2. Use transactions where possible
  3. Add descriptive comments
  4. Test migrations with sample data
  5. Follow the existing migration naming convention
custom_rules: |
  - Never drop tables without explicit confirmation
  - Always backup data before destructive operations
  - Use parameterized queries to prevent SQL injection
temperature: 0.2
max_tokens: 3000

Agent Templates

Agents can use templates for dynamic prompts:

id: test-generator
system_prompt: |
  You are a test writing expert for {{language}} using {{test_framework}}.
  Generate comprehensive tests that cover:
  - Happy path scenarios
  - Edge cases
  - Error conditions
user_prompt: |
  Generate tests for: {{code_snippet}}

Reasoning Configuration

Some models support extended reasoning capabilities:

reasoning:
  enabled: true
  effort: medium      # low, medium, or high
  max_tokens: 2048    # Thinking budget
  exclude: false      # Show reasoning process

Reasoning configuration is supported by providers like OpenRouter, Anthropic, and Forge Services. It enables deeper analysis at the cost of additional tokens.

Context Compaction

Automatically manage conversation length:

compact:
  token_threshold: 100000      # Compact when exceeding this
  message_threshold: 100       # Or this many messages
  turn_threshold: 50          # Or this many turns
  retention_window: 10        # Keep last N messages
  eviction_window: 0.5        # Compact up to 50% of messages
  model: claude-3-haiku       # Use cheaper model for compaction

Compaction summarizes old conversation parts to keep context manageable while preserving important information.

Agent Lifecycle

When an agent is invoked:

Initialization - Agent loads with its configuration
Context Building - System prompt and user prompt are rendered with context
Execution - Agent processes the request using available tools
Tool Calls - Agent can call tools multiple times in a single turn
Response - Agent provides the final response
Compaction (if needed) - Conversation is compacted based on settings

Provider and Model Selection

Agents can specify their provider and model:

provider: anthropic
model: claude-3.7-sonnet

If not specified, agents inherit from the workflow configuration or use the default provider.

Best Practices

Specialize Your Agents

Create agents with focused expertise:

✅ security-auditor - Security-focused code review
✅ docs-writer - Technical documentation generation
❌ do-everything - Jack of all trades, master of none

Write Clear System Prompts

Your system prompt is the agent’s “programming”. Be explicit:

system_prompt: |
  You are a frontend performance expert.
  
  When analyzing code, always check:
  1. Unnecessary re-renders in React components
  2. Unoptimized images
  3. Large bundle sizes
  4. Blocking network requests
  
  Provide specific, actionable recommendations with code examples.

Limit Tool Access

Only grant necessary tools:

# Good: Documentation agent only needs read access
tools:
  - read
  - grep
  - glob

# Bad: Giving write access unnecessarily
tools:
  - read
  - write  # Not needed for documentation!
  - bash

Use Temperature Wisely

0.0-0.3: Code generation, refactoring, precise tasks
0.4-0.7: Balanced tasks, problem-solving
0.8-2.0: Creative tasks, brainstorming, exploration

FAQ

Getting Started

Core Concepts

Configuration

Providers

Features

Advanced Usage

Guides

Agent System

Agent System

What is an Agent?

Agent Anatomy

Built-in Agents

Forge Agent

Muse Agent

Sage Agent

Agent Configuration

Model Settings

Tool Configuration

Turn Limits

Custom Agents

Agent Templates

Reasoning Configuration

Context Compaction

Agent Lifecycle

Provider and Model Selection

Best Practices

Specialize Your Agents

Write Clear System Prompts

Limit Tool Access

Use Temperature Wisely

Build docs developers (and LLMs) love

Getting Started

Core Concepts

Configuration

Providers

Features

Advanced Usage

Guides

​Agent System

​What is an Agent?

​Agent Anatomy

​Built-in Agents

​Forge Agent

​Muse Agent

​Sage Agent

​Agent Configuration

​Model Settings

​Tool Configuration

​Turn Limits

​Custom Agents

​Agent Templates

​Reasoning Configuration

​Context Compaction

​Agent Lifecycle

​Provider and Model Selection

​Best Practices

​Specialize Your Agents

​Write Clear System Prompts

​Limit Tool Access

​Use Temperature Wisely

Build docs developers (and LLMs) love

Agent System

What is an Agent?

Agent Anatomy

Built-in Agents

Forge Agent

Muse Agent

Sage Agent

Agent Configuration

Model Settings

Tool Configuration

Turn Limits

Custom Agents

Agent Templates

Reasoning Configuration

Context Compaction

Agent Lifecycle

Provider and Model Selection

Best Practices

Specialize Your Agents

Write Clear System Prompts

Limit Tool Access

Use Temperature Wisely