Advanced Configuration

Overview

Advanced configuration options allow fine-tuning of the CLI Proxy API behavior including network proxies, request payload manipulation, custom headers, retry strategies, and streaming optimizations.

Proxy Configuration

proxy-url

string

default:"(empty)"

Global proxy URL for outbound requests. Supports socks5, http, and https protocols.Format:

protocol://[user:pass@]host:port[/]

Examples:

proxy-url: "socks5://192.168.1.1:1080"
proxy-url: "http://proxy.company.com:8080"
proxy-url: "socks5://user:[email protected]:1080/"

Per-Provider Proxy Override

Each provider supports per-credential proxy configuration:

# Global default
proxy-url: "socks5://global-proxy:1080"

# Per-provider override
gemini-api-key:
  - api-key: "key1"
    proxy-url: "socks5://gemini-proxy:1080"  # Override for this key

claude-api-key:
  - api-key: "key2"
    proxy-url: ""  # Disable proxy for this key

codex-api-key:
  - api-key: "key3"
    # Uses global proxy-url

Custom Headers

Add custom HTTP headers to upstream provider requests.

<provider>[].headers

object

Custom headers for provider requests. Applied to all requests made with this credential.Example:

gemini-api-key:
  - api-key: "AIzaSy..."
    headers:
      X-Custom-Header: "custom-value"
      X-Team-ID: "team-alpha"

Supported Providers

All API key providers support custom headers:

gemini-api-key
claude-api-key
codex-api-key
openai-compatibility
vertex-api-key

Headers Example

claude-api-key:
  - api-key: "sk-atSM..."
    base-url: "https://api.anthropic.com"
    headers:
      X-Custom-Header: "custom-value"
      X-Request-Source: "internal-proxy"
      Cookie: "session=abc123"

openai-compatibility:
  - name: "custom-provider"
    base-url: "https://api.provider.com/v1"
    api-key-entries:
      - api-key: "provider-key"
    headers:
      X-Provider-ID: "prod-001"
      User-Agent: "CustomClient/1.0"
    models:
      - name: "model-1"
        alias: "m1"

Passthrough Headers

passthrough-headers

boolean

default:"false"

When true, forward filtered upstream response headers to downstream clients.Forwarded headers include:

Content-Type
X-Request-ID
Custom provider headers

Example:

passthrough-headers: true

Security-sensitive headers (Authorization, Set-Cookie) are always filtered and never forwarded.

Retry Configuration

request-retry

integer

default:"3"

Number of times to retry a request. Retries occur if HTTP response code is 403, 408, 500, 502, 503, or 504.Example:

request-retry: 5

max-retry-credentials

integer

default:"0"

Maximum number of different credentials to try for one failed request.Values:

0 - Try all available credentials (legacy behavior)
Positive integer - Limit credential rotation

Example:

max-retry-credentials: 3  # Try up to 3 different credentials

max-retry-interval

integer

default:"30"

Maximum wait time in seconds for a cooled-down credential before triggering a retry.Example:

max-retry-interval: 60  # Wait up to 60 seconds

Quota Exceeded Behavior

quota-exceeded

object

Behavior when API quota limits are exceeded.

Show Quota Properties

quota-exceeded.switch-project

boolean

default:"true"

Whether to automatically switch to another project when quota is exceeded.

quota-exceeded.switch-preview-model

boolean

default:"true"

Whether to automatically switch to a preview model when quota is exceeded.

Retry Example

# Retry up to 5 times
request-retry: 5

# Try up to 3 different credentials
max-retry-credentials: 3

# Wait up to 60 seconds for cooldown
max-retry-interval: 60

# Quota exceeded behavior
quota-exceeded:
  switch-project: true
  switch-preview-model: true

Routing Strategy

routing.strategy

string

default:"round-robin"

Credential selection strategy when multiple credentials match.Supported values:

"round-robin" - Distribute requests evenly across all credentials
"fill-first" - Use first credential until quota exhausted, then next

Example:

routing:
  strategy: "round-robin"

Streaming Configuration

streaming

object

Server-side streaming behavior (SSE keep-alives and safe bootstrap retries).

Show Streaming Properties

streaming.keepalive-seconds

integer

default:"0"

How often the server emits SSE heartbeats (: keep-alive\n\n).Values:

0 - Disabled (default)
Positive integer - Keep-alive interval in seconds

Set to 15-30 seconds to prevent idle connection timeouts from proxies and load balancers.

streaming.bootstrap-retries

integer

default:"0"

How many times the server may retry a streaming request before any bytes are sent.Values:

0 - Disabled (default)
Positive integer - Number of retries

Bootstrap retries allow auth rotation and transient recovery before the client sees any response.

nonstream-keepalive-interval

integer

default:"0"

Emit blank lines every N seconds for non-streaming responses to prevent idle timeouts.Values:

0 - Disabled (default)
Positive integer - Interval in seconds

Example:

nonstream-keepalive-interval: 30

Streaming Example

streaming:
  keepalive-seconds: 15    # SSE heartbeat every 15 seconds
  bootstrap-retries: 1     # Retry once before sending response

nonstream-keepalive-interval: 30  # Keep-alive for non-streaming

Payload Rules

Payload rules allow modification of request payloads before forwarding to upstream providers.

payload

object

Default and override parameter rules applied to provider payloads.

Show Rule Types

payload.default

array

Rules that only set parameters when they are missing in the payload.

payload.default-raw

array

Rules that set raw JSON values only when missing (must be valid JSON).

payload.override

array

Rules that always set parameters, overwriting any existing values.

payload.override-raw

array

Rules that always set raw JSON values, overwriting existing values (must be valid JSON).

payload.filter

array

Rules that remove parameters from the payload by JSON path.

Payload Rule Structure

Each rule contains:

models - List of model patterns and protocol constraints
params - JSON path → value mappings (or array of paths for filter rules)

Payload Examples

Default Rules (Set if Missing)

payload:
  default:
    - models:
        - name: "gemini-2.5-pro"  # Exact match
          protocol: "gemini"       # Restrict to Gemini protocol
      params:
        "generationConfig.thinkingConfig.thinkingBudget": 32768
    
    - models:
        - name: "gemini-*"         # Wildcard match
          protocol: "gemini"
      params:
        "generationConfig.temperature": 0.7

Default Raw Rules (Raw JSON)

payload:
  default-raw:
    - models:
        - name: "gemini-2.5-pro"
          protocol: "gemini"
      params:
        "generationConfig.responseJsonSchema": '{"type":"object","properties":{"answer":{"type":"string"}}}'

Raw JSON values must be valid JSON strings. Invalid JSON will cause the rule to be dropped on startup.

Override Rules (Always Set)

payload:
  override:
    - models:
        - name: "gpt-*"            # Wildcard for all GPT models
          protocol: "codex"        # OpenAI/Codex protocol
      params:
        "reasoning.effort": "high"
        "temperature": 0.5

Override Raw Rules

payload:
  override-raw:
    - models:
        - name: "gpt-*"
          protocol: "codex"
      params:
        "response_format": '{"type":"json_schema","json_schema":{"name":"answer","schema":{"type":"object"}}}'

Filter Rules (Remove Parameters)

payload:
  filter:
    - models:
        - name: "gemini-2.5-pro"
          protocol: "gemini"
      params:  # List of JSON paths to remove
        - "generationConfig.thinkingConfig.thinkingBudget"
        - "generationConfig.responseJsonSchema"

Model Patterns

Supports wildcard matching:

"gpt-*" - Prefix match (gpt-4, gpt-5, etc.)
"*-mini" - Suffix match (gpt-4-mini, claude-3-haiku-mini, etc.)
"*codex*" - Substring match (gpt-5-codex, codex-turbo, etc.)
"gemini-2.5-pro" - Exact match

Protocol Constraints

Supported protocols:

"openai" - OpenAI Chat Completions format
"gemini" - Google Gemini format
"claude" - Anthropic Claude format
"codex" - OpenAI Codex format
"antigravity" - Gemini 3.0 experimental

WebSocket Authentication

ws-auth

boolean

default:"false"

When true, enable authentication for the WebSocket API (/v1/ws).Example:

ws-auth: true

WebSocket authentication uses the same API keys as HTTP requests.

Complete Advanced Configuration Example

# Global Proxy
proxy-url: "socks5://192.168.1.1:1080"

# Headers Passthrough
passthrough-headers: true

# Retry Configuration
request-retry: 5
max-retry-credentials: 3
max-retry-interval: 60

quota-exceeded:
  switch-project: true
  switch-preview-model: true

# Routing Strategy
routing:
  strategy: "round-robin"

# Streaming
streaming:
  keepalive-seconds: 15
  bootstrap-retries: 1

nonstream-keepalive-interval: 30

# WebSocket Auth
ws-auth: true

# Payload Rules
payload:
  # Set defaults if missing
  default:
    - models:
        - name: "gemini-*"
          protocol: "gemini"
      params:
        "generationConfig.temperature": 0.7
  
  # Override always
  override:
    - models:
        - name: "gpt-*"
          protocol: "codex"
      params:
        "reasoning.effort": "high"
  
  # Remove parameters
  filter:
    - models:
        - name: "claude-*"
          protocol: "claude"
      params:
        - "metadata.user_id"

# Provider-Specific Settings
gemini-api-key:
  - api-key: "AIzaSy..."
    proxy-url: "socks5://gemini-proxy:1080"  # Override global
    headers:
      X-Custom-Header: "custom-value"

claude-api-key:
  - api-key: "sk-atSM..."
    headers:
      X-Team-ID: "team-alpha"

Best Practices

Proxy Configuration:

Use global proxy-url for most cases
Override per-provider only when necessary
Test proxy connectivity before deployment
Use authentication for proxy servers in production

Payload Rules:

Test payload rules with actual requests
Use default rules for optional parameters
Use override rules sparingly (may break client expectations)
Validate raw JSON strings before deployment
Order matters: filter rules run before defaults and overrides

Retry Strategy:

Higher request-retry increases reliability but adds latency
Set max-retry-credentials to prevent exhausting all keys on persistent failures
Adjust max-retry-interval based on your quota reset periods

Troubleshooting

Proxy Not Working

Verify proxy URL format (protocol://host:port)
Test proxy connectivity outside the application
Check firewall rules allow proxy connections
Verify authentication credentials if required

Payload Rules Not Applied

Check model name pattern matches exactly (case-sensitive)
Verify protocol constraint matches provider
Validate raw JSON strings are well-formed
Check server logs for rule validation errors

Headers Not Forwarded

Ensure passthrough-headers: true
Check upstream provider actually sends the headers
Verify headers are not security-sensitive (filtered automatically)

Excessive Retries

Lower request-retry value
Set max-retry-credentials to limit credential rotation
Check upstream provider status (may be down)
Review quota limits and cooldown periods

Get Started

Core Concepts

Configuration

OAuth Authentication

Integrations

Deployment

Overview

Proxy Configuration

Per-Provider Proxy Override

Custom Headers

Supported Providers

Headers Example

Passthrough Headers

Retry Configuration

Quota Exceeded Behavior

Retry Example

Routing Strategy

Streaming Configuration

Streaming Example

Payload Rules

Payload Rule Structure

Payload Examples

Default Rules (Set if Missing)

Default Raw Rules (Raw JSON)

Override Rules (Always Set)

Override Raw Rules

Filter Rules (Remove Parameters)

Model Patterns

Protocol Constraints

WebSocket Authentication

Complete Advanced Configuration Example

Best Practices

Troubleshooting

Proxy Not Working

Payload Rules Not Applied

Headers Not Forwarded

Excessive Retries

Build docs developers (and LLMs) love

Get Started

Core Concepts

Configuration

OAuth Authentication

Integrations

Deployment

​Overview

​Proxy Configuration

​Per-Provider Proxy Override

​Custom Headers

​Supported Providers

​Headers Example

​Passthrough Headers

​Retry Configuration

​Quota Exceeded Behavior

​Retry Example

​Routing Strategy

​Streaming Configuration

​Streaming Example

​Payload Rules

​Payload Rule Structure

​Payload Examples

​Default Rules (Set if Missing)

​Default Raw Rules (Raw JSON)

​Override Rules (Always Set)

​Override Raw Rules

​Filter Rules (Remove Parameters)

​Model Patterns

​Protocol Constraints

​WebSocket Authentication

​Complete Advanced Configuration Example

​Best Practices

​Troubleshooting

​Proxy Not Working

​Payload Rules Not Applied

​Headers Not Forwarded

​Excessive Retries

Build docs developers (and LLMs) love

Overview

Proxy Configuration

Per-Provider Proxy Override

Custom Headers

Supported Providers

Headers Example

Passthrough Headers

Retry Configuration

Quota Exceeded Behavior

Retry Example

Routing Strategy

Streaming Configuration

Streaming Example

Payload Rules

Payload Rule Structure

Payload Examples

Default Rules (Set if Missing)

Default Raw Rules (Raw JSON)

Override Rules (Always Set)

Override Raw Rules

Filter Rules (Remove Parameters)

Model Patterns

Protocol Constraints

WebSocket Authentication

Complete Advanced Configuration Example

Best Practices

Troubleshooting

Proxy Not Working

Payload Rules Not Applied

Headers Not Forwarded

Excessive Retries