REST API Overview

The AgentOS REST API provides HTTP endpoints for managing agents, workflows, and chat completions. The API is built on the iii-engine and runs on port 3111 by default.

Base URL

The default base URL for all API endpoints is:

http://localhost:3111

You can configure the port and host in config.yaml:

modules:
  - class: modules::api::RestApiModule
    config:
      port: 3111
      host: 0.0.0.0
      default_timeout: 300000
      concurrency_request_limit: 2048

Authentication

All API endpoints require authentication via API keys. Include your API key in the request headers:

Authorization: Bearer YOUR_API_KEY

To set an API key for a provider:

agentos config set-key anthropic $ANTHROPIC_API_KEY

Rate Limiting

The API implements GCRA (Generic Cell Rate Algorithm) rate limiting per IP address:

Read operations: 200 requests/hour
Write operations: 100 requests/hour
Message operations: 60 requests/hour
Workflow runs: 100 requests/hour

When rate limited, you’ll receive a 429 status code with a retryAfter value in seconds.

Security Headers

All responses include security headers:

X-Content-Type-Options: nosniff
X-Frame-Options: DENY
X-XSS-Protection: 1; mode=block
Strict-Transport-Security: max-age=31536000; includeSubDomains

CORS Configuration

CORS is enabled by default with the following settings:

cors:
  allowed_origins: ['*']
  allowed_methods: [GET, POST, PUT, DELETE, OPTIONS]
  allowed_headers: ['*']

Available Endpoints

OpenAI Compatibility

Chat Completions

OpenAI-compatible chat completions endpoint

Agent Management

Agent Endpoints

Create, list, get, delete, and message agents

Workflow Execution

Workflow Endpoints

Create, list, run workflows and get run status

Health Check

Check the API health status:

GET /api/health

Response:

{
  "status": "healthy",
  "version": "0.1.0",
  "workers": 15,
  "uptime": 3600.5
}

Error Handling

The API returns standard HTTP status codes:

200 OK - Request succeeded
201 Created - Resource created successfully
204 No Content - Request succeeded with no response body
401 Unauthorized - Missing or invalid authentication
404 Not Found - Resource not found
429 Too Many Requests - Rate limit exceeded
500 Internal Server Error - Server error

Error Response Format:

{
  "error": "Error message",
  "retryAfter": 60
}

Observability

The API includes OpenTelemetry metrics:

api_request_duration_ms - Request duration histogram
function_error_total - Error counter by function
Request counts by path, method, and status code

Metrics can be exported via the configured exporter in config.yaml.

Rust API

TypeScript API

REST API

LLM Providers

REST API Overview

Base URL

Authentication

Rate Limiting

Security Headers

CORS Configuration

Available Endpoints

OpenAI Compatibility

Chat Completions

Agent Management

Agent Endpoints

Workflow Execution

Workflow Endpoints

Health Check

Error Handling

Observability

Build docs developers (and LLMs) love

Rust API

TypeScript API

REST API

LLM Providers

​Base URL

​Authentication

​Rate Limiting

​Security Headers

​CORS Configuration

​Available Endpoints

​OpenAI Compatibility

Chat Completions

​Agent Management

Agent Endpoints

​Workflow Execution

Workflow Endpoints

​Health Check

​Error Handling

​Observability

Build docs developers (and LLMs) love

Base URL

Authentication

Rate Limiting

Security Headers

CORS Configuration

Available Endpoints

OpenAI Compatibility

Agent Management

Workflow Execution

Health Check

Error Handling

Observability