Account Rotation

Overview

Account rotation is the core mechanism that selects which account to use for each request. The system uses a hybrid selection algorithm that combines:

Health scores - Track success/failure patterns
Token buckets - Prevent preemptive rate limiting
Freshness - Distribute load across accounts
Session affinity - Maintain consistency within conversations

Selection Algorithm

Hybrid Scoring Formula

The account selection uses a weighted scoring system (lib/rotation.ts:318-393):

Score = (health × 2) + (tokens × 5) + (hoursSinceUsed × 2)

Weight breakdown:

Health weight: 2 - Moderate preference for healthy accounts
Token weight: 5 - Strong preference for accounts with quota headroom
Freshness weight: 2 - Balanced load distribution

Selection Flow

Implementation

From lib/rotation.ts:318-393:

export function selectHybridAccount(
  accounts: AccountWithMetrics[],
  healthTracker: HealthScoreTracker,
  tokenTracker: TokenBucketTracker,
  quotaKey?: string,
  config: Partial<HybridSelectionConfig> = {},
  options: HybridSelectionOptions = {}
): AccountWithMetrics | null {
  const cfg = { 
    healthWeight: 2,
    tokenWeight: 5,
    freshnessWeight: 2.0,
    ...config 
  };
  
  const available = accounts.filter(a => a.isAvailable);
  
  // Fallback: return least recently used if none available
  if (available.length === 0) {
    return accounts.reduce((oldest, curr) => 
      curr.lastUsed < oldest.lastUsed ? curr : oldest
    );
  }
  
  let bestAccount: AccountWithMetrics | null = null;
  let bestScore = -Infinity;
  
  for (const account of available) {
    const health = healthTracker.getScore(account.index, quotaKey);
    const tokens = tokenTracker.getTokens(account.index, quotaKey);
    const hoursSinceUsed = (Date.now() - account.lastUsed) / (1000 * 60 * 60);
    
    const score = 
      health * cfg.healthWeight +
      tokens * cfg.tokenWeight +
      hoursSinceUsed * cfg.freshnessWeight;
    
    if (score > bestScore) {
      bestScore = score;
      bestAccount = account;
    }
  }
  
  return bestAccount;
}

Health Score Tracking

Score Dynamics

Health scores range from 0-100 with passive recovery (lib/rotation.ts:17-127):

interface HealthScoreConfig {
  successDelta: number;           // +1 per success
  rateLimitDelta: number;         // -10 per rate limit
  failureDelta: number;           // -20 per failure
  maxScore: number;               // 100
  minScore: number;               // 0
  passiveRecoveryPerHour: number; // +2 per hour idle
}

Passive Recovery

Unused accounts gradually recover health:

private applyPassiveRecovery(entry: HealthEntry): number {
  const now = Date.now();
  const hoursSinceUpdate = (now - entry.lastUpdated) / (1000 * 60 * 60);
  const recovery = hoursSinceUpdate * this.config.passiveRecoveryPerHour;
  return Math.min(entry.score + recovery, this.config.maxScore);
}

Example recovery timeline:

Account hits rate limit: health drops to 60 (-10)
After 1 hour idle: health recovers to 62 (+2)
After 10 hours idle: health fully recovers to 100
After 20 hours idle: capped at 100

Token Bucket Rate Limiting

Client-Side Token Tracking

Prevent requests before server-side rate limits hit (lib/rotation.ts:131-261):

interface TokenBucketConfig {
  maxTokens: number;        // 50 (max bucket capacity)
  tokensPerMinute: number;  // 6 (refill rate)
}

Token Consumption

Each request consumes one token:

tryConsume(accountIndex: number, quotaKey?: string): boolean {
  const currentTokens = this.refillTokens(entry);
  
  if (currentTokens < 1) {
    return false; // Bucket empty, account unavailable
  }
  
  this.buckets.set(key, {
    tokens: currentTokens - 1,
    lastRefill: Date.now(),
    consumptions: [...consumptions, Date.now()]
  });
  
  return true;
}

Token Refills

Tokens automatically refill over time:

private refillTokens(entry: TokenBucketEntry): number {
  const now = Date.now();
  const minutesSinceRefill = (now - entry.lastRefill) / (1000 * 60);
  const tokensToAdd = minutesSinceRefill * this.config.tokensPerMinute;
  return Math.min(entry.tokens + tokensToAdd, this.config.maxTokens);
}

Example timeline:

Account starts: 50 tokens
After 10 requests: 40 tokens remaining
After 1 minute: 46 tokens (40 + 6 refilled)
After 10 minutes: 50 tokens (capped at max)

Token Refunds

Network errors (not rate limits) can refund tokens within 30 seconds:

refundToken(accountIndex: number, quotaKey?: string): boolean {
  const now = Date.now();
  const cutoff = now - 30_000; // 30 second window
  
  const validIndex = entry.consumptions.findIndex(
    timestamp => timestamp >= cutoff
  );
  
  if (validIndex === -1) return false;
  
  // Refund the token
  entry.consumptions.splice(validIndex, 1);
  entry.tokens = Math.min(entry.tokens + 1, maxTokens);
  return true;
}

Failover Mechanisms

Automatic Rotation Triggers

Accounts rotate automatically when:

Rate Limit (429)

Action:

Mark account rate-limited with reset time
Drain token bucket (-10 tokens)
Reduce health score (-10 points)
Rotate to next available account

Recovery:

Account becomes available after Retry-After expires
Tokens refill at 6/minute
Health recovers at 2/hour

Auth Failure (401/403)

Action:

Attempt token refresh
If refresh fails 3+ times: mark account for cooldown
Rotate to next account

Recovery:

Manual re-authentication: codex auth login
Auto-retry after cooldown period

Server Error (5xx)

Action:

Reduce health score (-20 points)
Rotate to next account
Do NOT refund token (server-side failure)

Recovery:

Health recovers passively (2/hour)
Automatic retry after health improves

Network Error

Action:

Refund token if within 30s window
Reduce health score (-20 points)
Rotate to next account

Recovery:

Immediate retry with different account
Token refunded, no quota impact

Cooldown System

Accounts can be temporarily disabled (lib/accounts.ts:565-583):

markAccountCoolingDown(
  account: ManagedAccount, 
  cooldownMs: number, 
  reason: CooldownReason
): void {
  account.coolingDownUntil = Date.now() + cooldownMs;
  account.cooldownReason = reason;
}

isAccountCoolingDown(account: ManagedAccount): boolean {
  if (!account.coolingDownUntil) return false;
  if (Date.now() >= account.coolingDownUntil) {
    this.clearAccountCooldown(account);
    return false;
  }
  return true;
}

Cooldown reasons:

auth_failure - Multiple authentication failures
quota_exhausted - Primary + secondary quota both exhausted
manual - Manually disabled via codex auth disable <index>

Availability Checks

Account Availability

An account is available when ALL conditions are met:

isAccountAvailableForFamily(
  index: number, 
  family: ModelFamily, 
  model?: string
): boolean {
  const account = this.getAccountByIndex(index);
  if (!account) return false;
  if (account.enabled === false) return false;
  
  clearExpiredRateLimits(account);
  
  return !isRateLimitedForFamily(account, family, model) && 
         !this.isAccountCoolingDown(account);
}

Checks performed:

Account exists
Account not manually disabled
No active rate limits for model family
Not in cooldown period

Rate Limit Expiry

Expired rate limits are automatically cleared:

export function clearExpiredRateLimits(account: ManagedAccount): void {
  const now = Date.now();
  for (const [key, resetAt] of Object.entries(account.rateLimitResetTimes)) {
    if (resetAt <= now) {
      delete account.rateLimitResetTimes[key];
    }
  }
}

Monitoring and Diagnostics

View Account Health

# Quick health check
codex auth check

# Detailed health with live quota probes
codex auth check --live

# Account forecast (next recommended account)
codex auth forecast

# Forecast with live quota data
codex auth forecast --live --model gpt-5-codex

Health Dashboard

Run codex auth to see:

┌─────────────────────────────────────────────────────────────────┐
│ Account 1 ([email protected])                           [ACTIVE]  │
│ Health: ████████░░ 85/100    Tokens: ██████████ 42/50          │
│ Last used: 2m ago            Rate limits: none                  │
└─────────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────────┐
│ Account 2 ([email protected])                                 │
│ Health: ████░░░░░░ 40/100    Tokens: ████░░░░░░ 18/50          │
│ Last used: 15m ago           Rate limits: 2h 45% left (14:30)  │
└─────────────────────────────────────────────────────────────────┘

Configuration

Tuning Rotation Behavior

Customize selection weights in your Codex config:

{
  "multiAuth": {
    "rotation": {
      "healthWeight": 2.0,
      "tokenWeight": 5.0,
      "freshnessWeight": 2.0
    },
    "tokenBucket": {
      "maxTokens": 50,
      "tokensPerMinute": 6
    },
    "healthScore": {
      "successDelta": 1,
      "rateLimitDelta": -10,
      "failureDelta": -20,
      "passiveRecoveryPerHour": 2
    }
  }
}

Increase tokenWeight to more aggressively avoid accounts nearing rate limits. Increase freshnessWeight to distribute load more evenly.

Quota Management

Learn how quotas are tracked and prevent rate limits

Session Recovery

Understand session affinity and recovery

Multi-Account OAuth

See how accounts are authenticated

Settings Reference

View all configuration options

Getting Started

Core Concepts

Guides

Features

Overview

Selection Algorithm

Hybrid Scoring Formula

Selection Flow

Implementation

Health Score Tracking

Score Dynamics

Passive Recovery

Token Bucket Rate Limiting

Client-Side Token Tracking

Token Consumption

Token Refills

Token Refunds

Failover Mechanisms

Automatic Rotation Triggers

Cooldown System

Availability Checks

Account Availability

Rate Limit Expiry

Monitoring and Diagnostics

View Account Health

Health Dashboard

Configuration

Tuning Rotation Behavior

Quota Management

Session Recovery

Multi-Account OAuth

Settings Reference

Build docs developers (and LLMs) love

Getting Started

Core Concepts

Guides

Features

​Overview

​Selection Algorithm

​Hybrid Scoring Formula

​Selection Flow

​Implementation

​Health Score Tracking

​Score Dynamics

​Passive Recovery

​Token Bucket Rate Limiting

​Client-Side Token Tracking

​Token Consumption

​Token Refills

​Token Refunds

​Failover Mechanisms

​Automatic Rotation Triggers

​Cooldown System

​Availability Checks

​Account Availability

​Rate Limit Expiry

​Monitoring and Diagnostics

​View Account Health

​Health Dashboard

​Configuration

​Tuning Rotation Behavior

​Related Concepts

Quota Management

Session Recovery

Multi-Account OAuth

Settings Reference

Build docs developers (and LLMs) love

Overview

Selection Algorithm

Hybrid Scoring Formula

Selection Flow

Implementation

Health Score Tracking

Score Dynamics

Passive Recovery

Token Bucket Rate Limiting

Client-Side Token Tracking

Token Consumption

Token Refills

Token Refunds

Failover Mechanisms

Automatic Rotation Triggers

Cooldown System

Availability Checks

Account Availability

Rate Limit Expiry

Monitoring and Diagnostics

View Account Health

Health Dashboard

Configuration

Tuning Rotation Behavior

Related Concepts