Security Overview

AgentOS implements defense-in-depth security across all layers — from capability enforcement to cryptographic audit trails to sandboxed execution.

Security Architecture

AgentOS is built on fail-closed security principles: when a security service is unavailable, the system denies access rather than allowing potentially unsafe operations.

┌─────────────────────────────────────────────────────┐
│              Agent Request Layer                    │
├─────────────────────────────────────────────────────┤
│  RBAC Capability Check → Quota Enforcement          │
├─────────────────────────────────────────────────────┤
│  Prompt Injection Scan → Content Filtering          │
├─────────────────────────────────────────────────────┤
│  Rate Limiting → Loop Guard → Circuit Breaker       │
├─────────────────────────────────────────────────────┤
│  Tool Execution (Docker/WASM Sandbox)               │
├─────────────────────────────────────────────────────┤
│  Merkle Audit Chain → Ed25519 Signing               │
└─────────────────────────────────────────────────────┘

Security Features

RBAC

Per-agent capability enforcement with tool-level granularity

Audit Chain

Merkle-linked SHA-256 audit log with tamper detection

Sandboxing

Docker and WASM isolation with fuel limits

Vault

AES-256-GCM encrypted secrets with auto-lock

Fail-Closed Defaults

All security gates deny by default when services are unavailable:

const caps = await trigger("state::get", {
  scope: "capabilities",
  key: agentId,
}).catch(() => null);

if (!caps) {
  // Fail closed: deny if capabilities cannot be retrieved
  throw new Error(`Agent ${agentId} has no capabilities defined`);
}

Security services failing to respond results in request denial, preventing unauthorized access during outages.

Multi-Layer Protection

1. Authentication

Mutual Authentication Protocol (MAP): HMAC-SHA256 challenge-response between agents
Timing-Safe Comparison: Constant-time verification prevents timing attacks
Nonce Replay Protection: Single-use nonces with 60-second validity window

2. Authorization

RBAC: Tool-level capability enforcement per agent
Tool Policy: Allow/deny/approve policies per tool
Approval Tiers: Auto (read-only), Async (write), Sync (destructive)

3. Input Validation

Prompt Injection Scanning: 9 regex patterns detect common attacks
SQL Injection Prevention: Identifier validation on dynamic queries
Path Traversal Prevention: Safe path resolution in file operations

4. Execution Isolation

Docker Sandbox: Container-isolated code execution
WASM Sandbox: wasmtime-based execution with fuel limits
Process Timeout: Hard timeouts prevent runaway processes

5. Data Protection

Vault Encryption: AES-256-GCM for secrets at rest
Auto-Zeroing: Decrypted secrets cleared from memory after TTL
Taint Tracking: Information flow control (Secret, PII, UntrustedAgent)

6. Audit & Compliance

Merkle Audit Chain: SHA-256-linked immutable log
Ed25519 Signing: Cryptographic signatures for manifests
Tamper Detection: Chain integrity verification

Security Configuration

# config.yaml
security:
  rbac:
    enabled: true
    default_deny: true
  
  audit:
    enabled: true
    hmac_key: ${AUDIT_HMAC_KEY}
  
  vault:
    auto_lock_minutes: 30
    min_password_length: 8
  
  sandbox:
    docker:
      enabled: true
      network_mode: none
    wasm:
      default_fuel: 1000000
      default_timeout_secs: 30
      max_memory_pages: 256
  
  rate_limiting:
    enabled: true
    requests_per_minute: 60
  
  prompt_injection:
    enabled: true
    block_threshold: 0.5

Security Workers

AgentOS security is implemented across Rust and TypeScript workers:

Worker	Language	Purpose
`security` (Rust)	Rust	RBAC, audit chain, prompt injection, taint tracking
`security` (TS)	TypeScript	Capability enforcement, audit log
`security-map`	TypeScript	Mutual authentication protocol
`vault`	TypeScript	AES-256-GCM encrypted secrets
`wasm-sandbox`	Rust	WASM execution with fuel limits

Security Workflow

Every agent action flows through multiple security checkpoints:

Capability Check

Verify agent has permission to access the requested tool via RBAC.

Quota Enforcement

Check token usage against per-agent hourly limits.

Input Scanning

Scan input for prompt injection patterns and malicious content.

Rate Limiting

Apply GCRA token bucket rate limiting per agent.

Sandboxed Execution

Execute code in Docker or WASM sandbox with resource limits.

Audit Logging

Append event to Merkle-linked audit chain.

Security Guarantees

While AgentOS implements multiple layers of security, no system is 100% secure. Always:

Rotate credentials regularly
Review audit logs for anomalies
Apply principle of least privilege
Keep dependencies updated

What AgentOS Protects Against

✅ Prompt injection attacks — Regex-based pattern detection
✅ Unauthorized tool access — RBAC capability enforcement
✅ Resource exhaustion — Fuel limits, timeouts, rate limiting
✅ Audit tampering — Merkle-linked immutable chain
✅ Secrets exposure — AES-256-GCM encryption + auto-zeroing
✅ Replay attacks — Single-use nonces with timing verification
✅ Timing attacks — Constant-time HMAC comparison

What You Still Need to Protect

⚠️ Supply chain attacks — Review dependencies and lock versions
⚠️ Social engineering — Train users on security best practices
⚠️ Physical access — Secure infrastructure and credentials
⚠️ Zero-day exploits — Keep systems patched and monitored

CLI Security Commands

# View audit log
agentos security audit

# Verify audit chain integrity
agentos security verify

# Scan text for prompt injection
agentos security scan "user input text"

# Initialize encrypted vault
agentos vault init

# Store a secret
agentos vault set API_KEY sk-xxxxx

# List stored secrets (keys only)
agentos vault list

Next Steps

RBAC Configuration

Configure per-agent capabilities and tool policies

Audit Chain

Learn how to verify audit integrity and detect tampering

Sandboxing

Configure Docker and WASM sandboxes for code execution

Vault Setup

Securely store API keys and credentials

Get Started

Core Concepts

CLI Reference

Tools & Capabilities

Control Plane

Security

Advanced Features

Templates & Examples

Security Overview

Security Architecture

Security Features

RBAC

Audit Chain

Sandboxing

Vault

Fail-Closed Defaults

Multi-Layer Protection

1. Authentication

2. Authorization

3. Input Validation

4. Execution Isolation

5. Data Protection

6. Audit & Compliance

Security Configuration

Security Workers

Security Workflow

Security Guarantees

What AgentOS Protects Against

What You Still Need to Protect

CLI Security Commands

Next Steps

RBAC Configuration

Audit Chain

Sandboxing

Vault Setup

Build docs developers (and LLMs) love

Get Started

Core Concepts

CLI Reference

Tools & Capabilities

Control Plane

Security

Advanced Features

Templates & Examples

​Security Architecture

​Security Features

RBAC

Audit Chain

Sandboxing

Vault

​Fail-Closed Defaults

​Multi-Layer Protection

​1. Authentication

​2. Authorization

​3. Input Validation

​4. Execution Isolation

​5. Data Protection

​6. Audit & Compliance

​Security Configuration

​Security Workers

​Security Workflow

​Security Guarantees

​What AgentOS Protects Against

​What You Still Need to Protect

​CLI Security Commands

​Next Steps

RBAC Configuration

Audit Chain

Sandboxing

Vault Setup

Build docs developers (and LLMs) love

Security Architecture

Security Features

Fail-Closed Defaults

Multi-Layer Protection

1. Authentication

2. Authorization

3. Input Validation

4. Execution Isolation

5. Data Protection

6. Audit & Compliance

Security Configuration

Security Workers

Security Workflow

Security Guarantees

What AgentOS Protects Against

What You Still Need to Protect

CLI Security Commands

Next Steps