OpenAI Proxy

ANY /proxy/openai/*

Proxy requests to OpenAI API with automatic credential injection, policy enforcement, prompt drift detection, and cost tracking. All requests are logged in the audit trail.

Authentication

Does not require session authentication. Uses OpenAI API key from the credential vault.

The proxy does not use session tokens. Configure your OpenAI API key in the Fishnet credential vault, and the proxy will automatically inject it into upstream requests.

Endpoint Format

POST /proxy/openai/v1/chat/completions
GET /proxy/openai/v1/models
POST /proxy/openai/v1/embeddings
# Any OpenAI API endpoint

The path after /proxy/openai is forwarded to the OpenAI API (default: https://api.openai.com).

Request Flow

Rate limiting (if configured): Check llm.rate_limit_per_minute
LLM guards (if enabled):
- Model allowlist check (llm.allowed_models)
- Prompt drift detection (llm.prompt_drift)
- Prompt size limits (llm.prompt_size_guard)
Credential injection: Retrieve OpenAI API key from vault and add Authorization: Bearer sk-... header
Upstream request: Forward to OpenAI API
Cost tracking (if enabled): Parse usage and record cost based on llm.model_pricing
Audit logging: Record decision, cost, and cryptographic proof in audit log

Policy Enforcement

llm.rate_limit_per_minute

integer

default:"0"

Maximum requests per minute across all LLM providers. 0 = disabled.

llm.allowed_models

array

default:"[]"

List of allowed model names (case-insensitive). Empty = all models allowed.Example: ["gpt-4o", "gpt-4o-mini"]

llm.prompt_drift.enabled

boolean

default:"true"

Enable prompt drift detection. Records baseline system prompt and alerts on changes.

llm.prompt_drift.mode

string

default:"alert"

Action when drift is detected: alert (log warning) or deny (block request).

llm.prompt_size_guard.enabled

boolean

default:"false"

Enable prompt size limiting.

llm.prompt_size_guard.max_prompt_tokens

integer

default:"0"

Maximum total characters in prompt (approximate). 0 = no limit.

Cost Tracking

Fishnet tracks token usage and calculates costs based on model pricing:

[llm]
track_spend = true

[llm.model_pricing]
gpt-4o.input_per_million_usd = 2.50
gpt-4o.output_per_million_usd = 10.00
gpt-4o-mini.input_per_million_usd = 0.15
gpt-4o-mini.output_per_million_usd = 0.60

For streaming requests to /v1/chat/completions, Fishnet automatically injects stream_options.include_usage: true to ensure usage data is included in the stream.

Headers Forwarding

All request headers are forwarded to OpenAI except:

authorization (replaced with vault credential)
x-api-key (stripped)
host, connection, keep-alive, transfer-encoding, content-length (HTTP infrastructure)

You can include custom headers like X-Request-ID for tracing.

Body Forwarding

Request bodies are forwarded as-is to OpenAI. For JSON requests, Fishnet parses the body to:

Extract the model field for allowlist checking
Extract the stream field to detect streaming requests
Extract system prompts for drift detection
Count total characters for size limits

Examples

curl -X POST http://localhost:3080/proxy/openai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {"role": "system", "content": "You are a helpful assistant."},
      {"role": "user", "content": "What is Fishnet?"}
    ]
  }'

Streaming Example

import requests

url = "http://localhost:3080/proxy/openai/v1/chat/completions"
payload = {
    "model": "gpt-4o-mini",
    "messages": [{"role": "user", "content": "Count to 5"}],
    "stream": True
}

response = requests.post(url, json=payload, stream=True)

for line in response.iter_lines():
    if line:
        line = line.decode('utf-8')
        if line.startswith('data: '):
            data = line[6:]
            if data != '[DONE]':
                print(data)

Fishnet automatically adds stream_options.include_usage: true to chat completion requests to track token usage in streams.

Error Responses

400 Bad Request

object

{"error": "request body is not valid JSON"}

Invalid JSON body when Content-Type is application/json.

403 Forbidden

object

{"error": "model not in allowlist: gpt-4"}

Model not in llm.allowed_models list.

403 Forbidden

object

{"error": "System prompt drift detected: ..."}

Prompt drift detected and llm.prompt_drift.mode = deny.

403 Forbidden

object

{"error": "Prompt size 5000 chars exceeds limit of 1000"}

Prompt exceeds llm.prompt_size_guard.max_prompt_tokens when action is deny.

429 Too Many Requests

object

{
  "error": "rate limit exceeded, retry after 42s",
  "retry_after_seconds": 42
}

Rate limit exceeded (configured in llm.rate_limit_per_minute).

502 Bad Gateway

object

{"error": "upstream provider is unavailable"}

Failed to connect to OpenAI API.

Audit Log Entry

Each proxied request creates an audit log entry:

{
  "id": 42,
  "timestamp": 1709510400000,
  "intent_type": "api_call",
  "service": "openai",
  "action": "POST /v1/chat/completions",
  "decision": "approved",
  "reason": null,
  "cost_usd": 0.000123,
  "policy_version_hash": "a1b2c3...",
  "intent_hash": "123456...",
  "permit_hash": null,
  "merkle_root": "fedcba..."
}

Retrieve via GET /api/audit?service=openai.

Configuration

Add your OpenAI API key to the credential vault:

# Via Dashboard: Credentials > Add Credential
# Service: openai
# Name: (any name, e.g., "primary")
# Key: sk-...

Or via API:

curl -X POST http://localhost:3080/api/credentials \
  -H "Authorization: Bearer fn_sess_YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "service": "openai",
    "name": "primary",
    "key": "sk-..."
  }'

Authentication

System

Alerts

Spend Tracking

Credentials

Onchain

Audit Log

Proxy Endpoints

ANY /proxy/openai/*

Authentication

Endpoint Format

Request Flow

Policy Enforcement

Cost Tracking

Headers Forwarding

Body Forwarding

Examples

Streaming Example

Error Responses

Audit Log Entry

Configuration

Build docs developers (and LLMs) love

Authentication

System

Alerts

Spend Tracking

Credentials

Onchain

Audit Log

Proxy Endpoints

​ANY /proxy/openai/*

​Authentication

​Endpoint Format

​Request Flow

​Policy Enforcement

​Cost Tracking

​Headers Forwarding

​Body Forwarding

​Examples

​Streaming Example

​Error Responses

​Audit Log Entry

​Configuration

Build docs developers (and LLMs) love

ANY /proxy/openai/*

Authentication

Endpoint Format

Request Flow

Policy Enforcement

Cost Tracking

Headers Forwarding

Body Forwarding

Examples

Streaming Example

Error Responses

Audit Log Entry

Configuration