Troubleshooting

This guide helps you diagnose and resolve common issues with Codex-LB.

Authentication Issues

Invalid API Key

Error:

{
  "error": {
    "code": "invalid_api_key",
    "message": "Invalid API key",
    "type": "invalid_request_error"
  }
}

HTTP Status: 401 Unauthorized Common Causes:

Missing Authorization header

# Wrong
curl https://your-instance.com/v1/chat/completions

# Correct
curl https://your-instance.com/v1/chat/completions \
  -H "Authorization: Bearer sk-clb-your-key"

Incorrect key format
- API keys must start with sk-clb-
- Verify you’re not using a ChatGPT API key by mistake
Key has been deleted or deactivated
- Check the API Keys page in the dashboard
- Verify the key’s is_active status is true
Key has expired
- Check the expires_at field in the dashboard
- Create a new key or extend the expiration

Solution:

Verify Key Format

Ensure your key starts with sk-clb- and is the full token (not just the prefix).

Check Key Status

In the dashboard, verify:

Key exists in the API Keys list
is_active is true
expires_at is in the future (or null)

Test with New Key

Create a new API key and test with it to rule out key-specific issues.

OAuth Flow Failed

Error: “OAuth error: access_denied” or “Invalid OAuth callback state” Common Causes:

User denied authorization
- User clicked “Deny” or closed the browser window
- Session expired during authorization
State token mismatch
- CSRF protection detected a potential attack
- Session cookie was cleared or expired
- Multiple OAuth flows started simultaneously
Callback port not available
- Default callback port 1455 is already in use
- Firewall blocking the callback server

Solution:

Restart OAuth Flow

Close the OAuth dialog and start a fresh flow.

Complete Authorization Quickly

Don’t leave the authorization page open for extended periods. Complete it within a few minutes.

Check Callback Port

If using browser flow, ensure port 1455 is available:

# Check if port is in use
lsof -i :1455

# If in use, stop the conflicting process or use device flow instead

Try Device Flow

If browser flow continues to fail, use the device code flow instead:

Select “Device code” in the OAuth dialog
Copy the user code
Open the verification URL on any device
Enter the code and complete authorization

Account Deactivated

Symptom: Account shows status deactivated in the dashboard Common Causes:

Refresh token expired
- ChatGPT refresh tokens have a limited lifetime
- Token was revoked on the ChatGPT side
Account credentials revoked
- Password changed on ChatGPT account
- Account logged out of all sessions
- 2FA settings changed
Permanent authentication failure
- Account deleted or suspended
- Terms of service violation

Solution:

Remove Old Account

Delete the deactivated account from the dashboard.

Verify ChatGPT Account

Account is active and in good standing
No security alerts or verification required
Subscription is active (if applicable)

Re-add Account

Add the account again via OAuth with fresh credentials.

Monitor Status

Check the account status after a few requests to ensure it remains active.

Rate Limiting Issues

API Key Rate Limit Exceeded

Error:

{
  "error": {
    "code": "rate_limit_exceeded",
    "message": "API key total_tokens daily limit exceeded",
    "type": "rate_limit_error"
  }
}

HTTP Status: 429 Too Many Requests Common Causes:

Configured limit reached
- API key has hit one of its rate limits
- Check current_value vs max_value in the dashboard
Underestimated usage
- Actual usage higher than expected
- Large prompts or long completions
- Reasoning tokens for o1 models

Solution:

Check Current Usage

In the dashboard, view the API key’s limits:

{
  "limit_type": "total_tokens",
  "limit_window": "daily",
  "max_value": 1000000,
  "current_value": 1000000,
  "reset_at": "2026-03-04T00:00:00Z"
}

Note the reset_at time.

Wait for Reset

If you can wait, the limit will automatically reset at the reset_at time.Use the Retry-After header to know when to retry:

Retry-After: 43200

This indicates 43,200 seconds (12 hours) until reset.

Increase Limit

If you need immediate access:

Edit the API key in the dashboard
Increase the max_value for the limit
Save changes
Retry your request

Reset Usage (Emergency)

As a last resort, manually reset usage:

Edit the API key
Check “Reset usage”
Save changes

This sets all current_value fields to 0 immediately.

Account Rate Limited

Symptom: Account shows status rate_limited in the dashboard Common Causes:

ChatGPT rate limit hit
- Account hit ChatGPT’s rate limits (requests per minute, tokens per day, etc.)
- This is separate from Codex-LB API key limits
High traffic spike
- Sudden increase in request volume
- Not enough accounts to handle load

Solution:

Check Reset Time

ChatGPT rate limits typically reset after 3-60 minutes. Check the account’s reset_at field if available.

Wait for Recovery

Codex-LB automatically routes requests to other accounts. The rate-limited account will recover automatically.

Add More Accounts

To prevent future rate limiting:

Add more ChatGPT accounts to the load balancer
Distribute traffic across more accounts
Enable usage-weighted routing for better distribution

Quota Exceeded

Symptom: Account shows status quota_exceeded in the dashboard Common Causes:

ChatGPT quota exhausted
- Free tier accounts have daily/weekly quotas
- Plus/Team accounts have higher but still limited quotas
All accounts exhausted simultaneously
- Insufficient total quota for traffic volume

Solution:

Check Account Plan

Verify the account’s plan type:

Free: Very limited quota
Plus: Higher quota
Team/Enterprise: Highest quota

Wait for Reset

Quotas typically reset daily or weekly. The account will automatically recover.

Upgrade Accounts

Consider upgrading accounts to Plus or Team for higher quotas.

Add More Accounts

Add accounts with fresh quotas to increase total capacity.

Routing Issues

No Available Accounts

Error:

{
  "error": {
    "code": "no_accounts",
    "message": "No active accounts available",
    "type": "server_error"
  }
}

HTTP Status: 503 Service Unavailable Common Causes:

No accounts added
- Load balancer has no accounts configured
All accounts unavailable
- All accounts are rate_limited, quota_exceeded, paused, or deactivated
Token refresh failures
- All accounts have failed token refresh and are deactivated

Solution:

Check Account Status

In the dashboard, review all accounts:

Account A: rate_limited
Account B: quota_exceeded
Account C: deactivated

If all accounts are unavailable, you’ll see this error.

Add New Accounts

If no accounts exist or all are deactivated:

Click “Add Account” in the dashboard
Complete OAuth flow
Verify new account shows active status

Reactivate Paused Accounts

If accounts are paused:

Select the paused account
Click “Reactivate”
Verify status changes to active

Wait for Recovery

If accounts are rate limited or quota exceeded, wait for them to reset automatically.

Model Not Allowed

Error:

{
  "error": {
    "code": "model_not_allowed",
    "message": "Model 'gpt-4' is not allowed for this API key",
    "type": "invalid_request_error"
  }
}

HTTP Status: 403 Forbidden Common Causes:

Model not in allowed_models list
- API key has allowed_models configured
- Requested model is not in the list
Case mismatch
- Model names are case-sensitive
- gpt-4 vs GPT-4 vs Gpt-4

Solution:

Check Allowed Models

In the dashboard, check the API key’s allowed_models:

{
  "allowed_models": ["gpt-3.5-turbo", "gpt-4o-mini"]
}

If gpt-4 is not in this list, it will be rejected.

Update Allowed Models

Add the requested model to the list:

Edit the API key
Add the model to allowed_models
Ensure exact case match (e.g., gpt-4)
Save changes

Allow All Models

Alternatively, remove the restriction entirely:

Edit the API key
Set allowed_models to empty or null
Save changes

Sticky Sessions Not Working

Symptom: Requests with the same prompt_cache_key are routed to different accounts Common Causes:

Sticky threads disabled
- Setting sticky_threads_enabled is false
Missing prompt_cache_key
- Request doesn’t include prompt_cache_key in body
Account became unavailable
- Sticky account hit rate limit or was deactivated
- Session reallocated to different account

Solution:

Enable Sticky Threads

In Settings, enable “Sticky threads”:

{
  "sticky_threads_enabled": true
}

Include prompt_cache_key

Add prompt_cache_key to all related requests:

{
  "model": "gpt-4",
  "messages": [...],
  "prompt_cache_key": "conversation-123"
}

Monitor Account Status

If the sticky account becomes unavailable, Codex-LB automatically reallocates to another account. This is expected behavior.

Request Issues

Upstream Error

Error:

{
  "error": {
    "code": "upstream_error",
    "message": "Upstream service error",
    "type": "server_error"
  }
}

HTTP Status: 502 Bad Gateway or 503 Service Unavailable Common Causes:

ChatGPT API downtime
- Upstream ChatGPT API is experiencing issues
- Check OpenAI Status
Network connectivity
- Codex-LB cannot reach ChatGPT API
- Firewall or proxy blocking requests
Invalid request payload
- Malformed JSON or unsupported parameters
- Check request body for errors

Solution:

Check OpenAI Status

Visit https://status.openai.com/ to see if there are ongoing incidents.

Retry Request

Upstream errors are often transient. Retry with exponential backoff:

import time

for attempt in range(3):
    try:
        response = client.chat.completions.create(...)
        break
    except Exception as e:
        if attempt < 2:
            time.sleep(2 ** attempt)
        else:
            raise

Check Logs

Review Codex-LB logs for detailed error messages:

docker logs codex-lb

Look for connection errors, DNS failures, or timeout messages.

Verify Request Payload

Ensure your request is valid:

curl -X POST https://your-instance.com/v1/chat/completions \
  -H "Authorization: Bearer sk-clb-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4",
    "messages": [{"role": "user", "content": "test"}]
  }'

Timeout Errors

Symptom: Request takes a very long time or times out Common Causes:

Large prompts or completions
- Very long input or output
- O1 models with high reasoning effort
Upstream ChatGPT slow
- ChatGPT API is experiencing high latency
Client timeout too short
- Client has a timeout shorter than the request duration

Solution:

Increase Client Timeout

Set a higher timeout in your client:

from openai import OpenAI

client = OpenAI(
    base_url="https://your-instance.com/v1",
    api_key="sk-clb-your-key",
    timeout=120.0  # 120 seconds
)

Use Streaming

For long completions, use streaming to get partial results:

response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "..."}],
    stream=True
)

for chunk in response:
    print(chunk.choices[0].delta.content, end="")

Reduce Prompt Size

If possible, reduce the input size:

Summarize or truncate long contexts
Remove unnecessary information
Use more concise prompts

Invalid Response Format

Symptom: Response doesn’t match expected OpenAI format Common Causes:

Error response
- Request failed and returned an error object
- Check for error field in response
Streaming mode mismatch
- Client expects streaming but stream: false
- Client expects non-streaming but stream: true

Solution:

Check for Errors

Always check if the response is an error:

try:
    response = client.chat.completions.create(...)
except Exception as e:
    print(f"Error: {e}")

Match Streaming Mode

Ensure client and request agree on streaming:

# Non-streaming
response = client.chat.completions.create(
    model="gpt-4",
    messages=[...],
    stream=False  # or omit
)
print(response.choices[0].message.content)

# Streaming
response = client.chat.completions.create(
    model="gpt-4",
    messages=[...],
    stream=True
)
for chunk in response:
    print(chunk.choices[0].delta.content or "", end="")

Performance Issues

High Latency

Symptom: Requests take longer than expected Common Causes:

Upstream latency
- ChatGPT API is slow
- Check OpenAI status
Account selection overhead
- Many accounts to evaluate
- Complex routing logic
Database queries
- Slow database reads for account/key lookup

Solution:

Measure Latency Components

Check the X-Request-ID header and review logs to see:

Time to select account
Time for upstream request
Time for response processing

Optimize Routing

Use simpler routing strategy:

Switch to round_robin if using usage_weighted
Disable “prefer earlier reset” if enabled
Disable sticky sessions if not needed

Use Fewer Accounts

Remove inactive or redundant accounts:

Delete deactivated accounts
Remove paused accounts not in use
Keep only necessary accounts

High Error Rate

Symptom: Many requests fail with 429, 503, or 502 errors Common Causes:

Insufficient capacity
- Not enough accounts for request volume
- Accounts hitting rate limits frequently
Account issues
- Accounts becoming deactivated
- Token refresh failures

Solution:

Add More Accounts

Increase total capacity by adding more ChatGPT accounts.

Implement Client Retries

Add retry logic in your application:

from openai import OpenAI
import time

client = OpenAI(
    base_url="https://your-instance.com/v1",
    api_key="sk-clb-your-key",
    max_retries=3
)

Monitor Account Health

Regularly check account status in the dashboard and reactivate or replace problematic accounts.

Dashboard Issues

Cannot Access Dashboard

Symptom: Dashboard login page doesn’t load or returns error Common Causes:

Codex-LB not running
- Service is stopped or crashed
Wrong URL or port
- Dashboard runs on a different port (default: 3000)
Firewall blocking access
- Network firewall blocking dashboard port

Solution:

Verify Service Running

Check if Codex-LB is running:

docker ps | grep codex-lb
# or
systemctl status codex-lb

Check Port

Verify the correct port:

# Default: http://localhost:3000
curl http://localhost:3000/health

Check Logs

Review logs for startup errors:

docker logs codex-lb

Symptom: Cannot log in to dashboard Common Causes:

Wrong password
- Incorrect admin password
TOTP enabled
- 2FA required but not provided
Session expired
- Previous session timed out

Solution:

Verify Password

Double-check your admin password. If you’ve forgotten it, reset via environment variable or config file.

Enter TOTP Code

If 2FA is enabled, enter the 6-digit code from your authenticator app.

Clear Browser Cache

Clear cookies and try again:

Open browser developer tools
Go to Application → Cookies
Delete cookies for the dashboard domain
Refresh and log in again

Getting Help

If you’re still experiencing issues:

Check Logs: Review Codex-LB logs for detailed error messages
```
docker logs codex-lb --tail=100 --follow
```
Enable Debug Logging: Set log level to DEBUG for more details
```
LOG_LEVEL=DEBUG docker restart codex-lb
```
Review Documentation: Check other guides for related information:
Contact Support: Reach out with:
- Error messages (sanitized)
- Steps to reproduce
- Relevant log excerpts
- Configuration details (without secrets)

Get Started

Core Features

Client Setup

Configuration

Deployment

Guides

Authentication Issues

Invalid API Key

OAuth Flow Failed

Account Deactivated

Rate Limiting Issues

API Key Rate Limit Exceeded

Account Rate Limited

Quota Exceeded

Routing Issues

No Available Accounts

Model Not Allowed

Sticky Sessions Not Working

Request Issues

Upstream Error

Timeout Errors

Invalid Response Format

Performance Issues

High Latency

High Error Rate

Dashboard Issues

Cannot Access Dashboard

Getting Help

Next Steps

API Reference

Development

Build docs developers (and LLMs) love

Get Started

Core Features

Client Setup

Configuration

Deployment

Guides

​Authentication Issues

​Invalid API Key

​OAuth Flow Failed

​Account Deactivated

​Rate Limiting Issues

​API Key Rate Limit Exceeded

​Account Rate Limited

​Quota Exceeded

​Routing Issues

​No Available Accounts

​Model Not Allowed

​Sticky Sessions Not Working

​Request Issues

​Upstream Error

​Timeout Errors

​Invalid Response Format

​Performance Issues

​High Latency

​High Error Rate

​Dashboard Issues

​Cannot Access Dashboard

​Login Failed

​Getting Help

​Next Steps

API Reference

Development

Build docs developers (and LLMs) love

Authentication Issues

Invalid API Key

OAuth Flow Failed

Account Deactivated

Rate Limiting Issues

API Key Rate Limit Exceeded

Account Rate Limited

Quota Exceeded

Routing Issues

No Available Accounts

Model Not Allowed

Sticky Sessions Not Working

Request Issues

Upstream Error

Timeout Errors

Invalid Response Format

Performance Issues

High Latency

High Error Rate

Dashboard Issues

Cannot Access Dashboard

Login Failed

Getting Help

Next Steps