Threat model

Last reviewed: 2026-03-03 | Applies to: v0.7.2

Rampart is a policy engine for AI agents — not a sandbox, not a hypervisor, not a full isolation boundary. This document describes what Rampart protects against, what it doesn’t, and why.

What Rampart is

A firewall for AI agent tool calls. It evaluates commands, file operations, and network requests against YAML policies and makes allow/deny/log decisions in microseconds. It’s designed to catch the 95%+ case: an AI agent that hallucinated a dangerous command or got manipulated by a prompt injection.

Primary threat: misbehaving AI agents

Rampart’s target threat is an AI agent that:

Hallucinated a destructive command (rm -rf /, DROP TABLE)
Was manipulated by prompt injection (malicious content in a file or webpage told it to exfiltrate data)
Made a well-intentioned mistake (wrong environment, wrong file, wrong server)
Escalated beyond its intended scope (sub-agent spawning unrestricted tool calls)

These agents aren’t adversarial — they’re confused, manipulated, or wrong. Rampart catches them reliably.

Not the target: adversarial human attackers

Rampart does not claim to stop a skilled human who has already compromised your system. If an attacker has shell access, they can bypass Rampart the same way they’d bypass any userspace tool.

Rampart is one layer in defense-in-depth, not a replacement for OS hardening, network segmentation, or access control.

Trust boundaries

┌─────────────────────────────────────────────┐
│ Trusted                                      │
│  • Policy files (admin-authored YAML)        │
│  • Rampart binary                            │
│  • rampart serve process                     │
│  • Audit log directory (when user-separated) │
│  • Policy registry sources (when verified)   │
│  • HMAC signing key (~/.rampart/signing.key) │
├─────────────────────────────────────────────┤
│ Untrusted                                    │
│  • AI agent tool calls (all input)           │
│  • Agent-generated commands                  │
│  • MCP tool call parameters                  │
│  • Webhook response payloads (validated)     │
│  • Project-local .rampart/policy.yaml files  │
│  • Community policies (verified by SHA-256)  │
└─────────────────────────────────────────────┘

Policy files are the security boundary. If an attacker can modify policy files, Rampart’s guarantees do not hold. This is why user separation is recommended for production.

Known limitations

1. Interpreter bypass

Rampart evaluates the command string passed to the shell. If an agent runs python3 script.py, Rampart sees and evaluates python3 script.py — but cannot inspect what script.py does internally. Mitigations:

LD_PRELOAD cascade (v0.1.9+): rampart preload and rampart wrap intercept child processes spawned by allowed commands. python3 script.py calling os.system("rm -rf /") is caught — the subprocess goes through Rampart’s policy engine.
Interpreter one-liner blocking (v0.6.9+): Patterns like python3 -c, node -e, ruby -e, perl -e with dangerous system calls are blocked by default in standard/paranoid policies.
The optional rampart-verify sidecar uses LLM classification to assess intent of ambiguous commands via action: webhook.

Remaining surface: LD_PRELOAD cascade only applies to wrap/preload modes, not native hooks (Claude Code, Cline). Programs using native file I/O without shelling out, or setuid binaries that drop LD_PRELOAD, are not covered. Multi-step write-then-execute sequences are evaluated at both steps independently when file tool coverage is enabled.

2. Audit log rewrite

The hash-chained audit trail detects partial tampering — editing, inserting, or deleting individual records breaks the chain. However, a complete rewrite from scratch with a new valid chain is not detectable from the log file alone. Mitigations:

Run rampart serve as a separate user so the agent can’t access audit files
Enable SIEM export (--syslog or --cef) to send events to an external immutable system
Webhook notifications send real-time alerts to Discord/Slack — a separate record
External chain anchors every 100 events provide additional integrity checkpoints

For compliance environments, pair with external immutable logging (CloudTrail, Wazuh, etc.) for an independent trust anchor.

3. Token exposure in wrap mode

The rampart wrap shim needs a bearer token to authenticate with rampart serve. The token is stored in a 0600 file (not embedded in the shim script). The agent user can still read this file. Mitigations:

Run rampart serve as a separate user and store the token in that user’s home directory
Use native hooks (rampart setup claude-code) instead — hooks don’t require a token file

4. Encoding and obfuscation

An agent could encode commands to bypass pattern matching. Mitigations (v0.6.9+):

Rampart decodes base64 commands before pattern matching
Leading shell comments and ANSI escape sequences are stripped
Null bytes and control characters are removed
Subcommand extraction: $(cmd), backticks, eval 'cmd' — inner commands are matched independently
Common obfuscation patterns (base64 *, eval *, xxd -r | bash) trigger deny rules in standard policy
The semantic verification sidecar classifies intent regardless of encoding

Coverage: The two-layer approach (pattern matching + LLM classification) significantly reduces the obfuscation surface. v0.6.9 closed 10 specific bypass vectors identified in a security audit.

5. Framework-specific patching

Some agent frameworks (e.g., OpenClaw) don’t expose hook points for file operations. Rampart’s --patch-tools option modifies framework source files to add policy checks. These patches don’t survive framework upgrades. Security implications:

Timing window: Between framework upgrade and re-patch, file tools bypass all policies (exec shim remains active)
Silent degradation: If patched code changes in a new version, patches fail-open without warning
Re-run required: rampart setup openclaw --patch-tools --force must be re-run after every OpenClaw upgrade

Monkey-patching is fragile but functional. It closes a real security gap today while proper upstream hook support is developed.

6. Fail-open behavior

When rampart serve is unreachable, the shim defaults to fail-open — commands execute without policy checks. This is a deliberate design choice: fail-closed would lock you out of your own machine. Mitigations:

Monitor the Rampart service and alert on downtime
Use systemd/launchd to auto-restart on failure (rampart serve install does this)
Webhook notifications confirm the service is actively evaluating

Fail-open means a brief security gap during outages. Fail-closed means a crashed Rampart bricks your agent (and potentially your system). We chose availability over strict enforcement.

7. Regex complexity limits

Rampart imposes limits on regex patterns used for response matching to prevent ReDoS. Current limits:

Maximum pattern length: 500 characters
Nested quantifiers: Rejected at load time (patterns like (a+)*)
Execution timeout: 100ms per regex match
Response cap: 1MB maximum for response-side evaluation

These limits protect against both accidental performance degradation and malicious patterns.

8. No TLS on HTTP API

rampart serve communicates over plaintext HTTP. On localhost this is acceptable; for remote or team deployments, this means policy decisions transit unencrypted. Mitigations:

Default bind is 127.0.0.1 (localhost only)
For remote access, use a reverse proxy with TLS or SSH tunnel
TLS support for rampart serve is planned for a future release

9. In-memory approval store

Pending approvals are stored in memory and lost on service restart. If rampart serve restarts while an approval is pending, the requesting agent receives a timeout/denial. Mitigations:

Approvals typically resolve within seconds (human clicks approve/deny)
Service restarts are rare during active sessions
Persistent approval storage is planned for a future release

10. Project policy trust

Project-local .rampart/policy.yaml files are loaded automatically when present. A malicious repository could include a permissive project policy. Mitigations (v0.6.9+):

Project policies can only add restrictions, not weaken global policies (deny-wins)
Set RAMPART_NO_PROJECT_POLICY=1 to skip project policy loading in untrusted repos
Project policy denials are prefixed with [Project Policy] for visibility

11. Community policy supply chain

rampart policy fetch downloads policies from the registry with SHA-256 verification. However, the registry itself is hosted in the main repo — a compromise of the repository could introduce malicious policies. Mitigations:

SHA-256 verification prevents modification after registry publication
--dry-run flag allows inspection before installation
Policy linting (rampart policy lint) validates syntax and flags suspicious patterns

Integration-specific coverage

Integration	Exec Coverage	File Coverage	Response Scanning	Cascade
Native hooks (Claude Code)	✅	✅ (via hooks)	✅ PostToolUse	❌
Native hooks (Cline)	✅	✅ (via hooks)	❌	❌
`rampart wrap`	✅	❌	❌	✅ LD_PRELOAD
`rampart preload`	✅	❌	❌	✅ LD_PRELOAD
`rampart setup openclaw --patch-tools`	✅ (shim)	✅ (patched)	❌	❌
HTTP proxy	✅	✅	✅	❌
MCP proxy	✅	✅	✅	❌

Platform notes: macOS

v0.4.4 added 17 macOS-specific built-in policies covering Keychain access, Gatekeeper bypass, persistence mechanisms, user management, and AppleScript shell execution. These are active automatically in the standard and paranoid profiles.

Platform notes: Windows

v0.6.6 added Windows policy parity. Key differences from Linux/macOS:

No LD_PRELOAD — rampart preload is not available. Use native hooks or wrap mode instead.
No POSIX file permissions — chmod 0600 is not enforced by the OS. Token files and signing keys are created with default permissions; use Windows ACLs for hardening.
Binary upgrade — Windows forbids overwriting a running executable. rampart upgrade renames the current binary to .rampart.exe.old first, then installs the new one.
Path separators — Rampart normalizes backslashes to forward slashes internally for consistent policy matching.
Service management — rampart serve install creates a Windows service (not systemd/launchd). Auto-restart is configured by default.

Deployment recommendations

Setup	Agent reads audit?	Agent modifies policy?	Best for
Same user (default)	✅ Yes	✅ Yes	Development, testing
Separate user	❌ No	❌ No	Production, unsupervised agents
Separate user + SIEM	❌ No	❌ No	Enterprise, compliance

Prerequisite: The agent must run as a non-root user. If the agent runs as root, user separation provides no protection.

Sudo caveat: An agent with unrestricted sudo (NOPASSWD: ALL) can bypass user separation. Restrict sudo to specific commands your agent needs rather than granting blanket access.

Philosophy

Rampart is a seatbelt, not a roll cage. It catches the vast majority of dangerous situations an AI agent will encounter — accidental or manipulated. It doesn’t claim to stop every possible attack vector, and we’re honest about what falls outside its scope.

If you need full isolation, use a sandbox (container, VM, or a tool like nono). Rampart and sandboxes are complementary — use both for defense in depth.

Reporting security issues

If you’ve found a vulnerability not covered here, please email [email protected]. We’ll acknowledge within 48 hours and work with you on coordinated disclosure.

Please do not open public issues for security vulnerabilities.

Getting Started

Core Concepts

Integrations

Policies

Features

Guides

Security

What Rampart is

Primary threat: misbehaving AI agents

Not the target: adversarial human attackers

Trust boundaries

Known limitations

1. Interpreter bypass

2. Audit log rewrite

3. Token exposure in wrap mode

4. Encoding and obfuscation

5. Framework-specific patching

6. Fail-open behavior

7. Regex complexity limits

8. No TLS on HTTP API

9. In-memory approval store

10. Project policy trust

11. Community policy supply chain

Integration-specific coverage

Platform notes: macOS

Platform notes: Windows

Deployment recommendations

Philosophy

Reporting security issues

Build docs developers (and LLMs) love

Getting Started

Core Concepts

Integrations

Policies

Features

Guides

Security

​What Rampart is

​Primary threat: misbehaving AI agents

​Not the target: adversarial human attackers

​Trust boundaries

​Known limitations

​1. Interpreter bypass

​2. Audit log rewrite

​3. Token exposure in wrap mode

​4. Encoding and obfuscation

​5. Framework-specific patching

​6. Fail-open behavior

​7. Regex complexity limits

​8. No TLS on HTTP API

​9. In-memory approval store

​10. Project policy trust

​11. Community policy supply chain

​Integration-specific coverage

​Platform notes: macOS

​Platform notes: Windows

​Deployment recommendations

​Philosophy

​Reporting security issues

Build docs developers (and LLMs) love

What Rampart is

Primary threat: misbehaving AI agents

Not the target: adversarial human attackers

Trust boundaries

Known limitations

1. Interpreter bypass

2. Audit log rewrite

3. Token exposure in wrap mode

4. Encoding and obfuscation

5. Framework-specific patching

6. Fail-open behavior

7. Regex complexity limits

8. No TLS on HTTP API

9. In-memory approval store

10. Project policy trust

11. Community policy supply chain

Integration-specific coverage

Platform notes: macOS

Platform notes: Windows

Deployment recommendations

Philosophy

Reporting security issues