Rate Limits

Overview

The StellarStack API implements rate limiting using a token bucket algorithm to ensure fair usage and protect against abuse. Different endpoints have different rate limits based on their resource intensity and security requirements.

How Rate Limiting Works

Each request consumes one token from your bucket. Tokens are refilled over time based on the configured window. When your bucket is empty, requests are rejected with a 429 Too Many Requests status code.

Rate Limit Headers

All API responses include rate limit information:

X-RateLimit-Limit: 100
X-RateLimit-Remaining: 95
X-RateLimit-Reset: 1709308800

X-RateLimit-Limit

integer

Maximum requests allowed in the current window

X-RateLimit-Remaining

integer

Requests remaining in the current window

X-RateLimit-Reset

integer

Unix timestamp (seconds) when the rate limit resets

Retry-After

integer

Seconds to wait before retrying (only present when rate limited)

Rate Limit Tiers

Authentication Endpoints

Strict rate limiting for security-sensitive authentication operations:

5 requests per minuteApplies to login attempts to prevent brute force attacks.

5 requests per minuteLimits account creation to prevent spam.

/api/auth/forget-password/*

endpoint

5 requests per minuteProtects password reset functionality.

Rate Limited Response

{
  "error": "Too many authentication attempts, please try again in a minute"
}

Sensitive Operations

For password resets and email verification:

Sensitive Operations

endpoint

3 requests per 15 minutesMaximum rate: 3 requests every 900,000ms (15 minutes)

{
  "error": "Too many requests for this sensitive operation, please try again later"
}

General API Endpoints

Standard rate limit for most API operations:

/api/*

endpoint

100 requests per minuteApplies to all API endpoints unless a more specific limit is defined.

{
  "error": "API rate limit exceeded, please slow down"
}

Server Actions

Rate limiting for server control operations (start, stop, restart):

Server Actions

endpoint

10 requests per minute per serverRate limit is scoped to IP address + server ID combination.

{
  "error": "Too many server actions, please wait before trying again"
}

File Operations

Limits for file upload, download, and management:

File Operations

endpoint

30 requests per minuteApplies to file system operations on servers.

{
  "error": "Too many file operations, please slow down"
}

Webhook Testing

Rate limit for testing webhook endpoints:

Webhook Tests

endpoint

5 requests per minutePrevents abuse of webhook test functionality.

{
  "error": "Too many webhook tests, please wait before testing again"
}

Rate Limit Scope

IP-Based Limiting

Most rate limits are applied per client IP address. The API detects your IP from:

X-Forwarded-For header (first IP in list)
X-Real-IP header
CF-Connecting-IP header (Cloudflare)
Direct connection IP (fallback)

Combined Scoping

Server action rate limits use a combination of:

Client IP address
Server ID

This allows you to perform actions on multiple servers simultaneously while preventing abuse on individual servers.

Handling Rate Limits

Best Practices

Monitor Headers - Check X-RateLimit-Remaining before making requests
Respect Retry-After - Wait the specified time before retrying
Implement Backoff - Use exponential backoff for retries
Batch Operations - Combine multiple operations where possible
Cache Responses - Store frequently accessed data locally

Example: Checking Rate Limits

const response = await fetch('http://localhost:3001/api/servers', {
  headers: {
    'Cookie': 'better-auth.session_token=<token>'
  }
});

const remaining = response.headers.get('X-RateLimit-Remaining');
const reset = response.headers.get('X-RateLimit-Reset');

console.log(`Requests remaining: ${remaining}`);
console.log(`Resets at: ${new Date(reset * 1000)}`);

if (response.status === 429) {
  const retryAfter = response.headers.get('Retry-After');
  console.log(`Rate limited. Retry after ${retryAfter} seconds`);
}

Example: Exponential Backoff

async function makeRequestWithBackoff(url, options, maxRetries = 3) {
  for (let i = 0; i < maxRetries; i++) {
    const response = await fetch(url, options);
    
    if (response.status !== 429) {
      return response;
    }
    
    const retryAfter = response.headers.get('Retry-After');
    const delay = retryAfter ? parseInt(retryAfter) * 1000 : Math.pow(2, i) * 1000;
    
    console.log(`Rate limited. Waiting ${delay}ms before retry...`);
    await new Promise(resolve => setTimeout(resolve, delay));
  }
  
  throw new Error('Max retries exceeded');
}

Production Considerations

The current implementation uses an in-memory store for rate limiting. For distributed production deployments, consider using Redis or another distributed cache to share rate limit state across API instances.

Cleanup

Rate limit entries are automatically cleaned up:

Old entries (>1 hour) are removed every 5 minutes
Helps prevent memory leaks in long-running processes

Bypassing Rate Limits

There is no built-in mechanism to bypass rate limits. All clients, including administrators, are subject to the same limits to ensure system stability. If you need higher limits for specific use cases, consider:

Modifying the rate limit configuration in the source code
Implementing custom rate limit tiers based on user roles
Using request batching to reduce total request count

Introduction

Endpoints

Overview

How Rate Limiting Works

Rate Limit Headers

Rate Limit Tiers

Authentication Endpoints

Rate Limited Response

Sensitive Operations

General API Endpoints

Server Actions

File Operations

Webhook Testing

Rate Limit Scope

IP-Based Limiting

Combined Scoping

Handling Rate Limits

Best Practices

Example: Checking Rate Limits

Example: Exponential Backoff

Production Considerations

Cleanup

Bypassing Rate Limits

Build docs developers (and LLMs) love

Introduction

Endpoints

​Overview

​How Rate Limiting Works

​Rate Limit Headers

​Rate Limit Tiers

​Authentication Endpoints

​Rate Limited Response

​Sensitive Operations

​General API Endpoints

​Server Actions

​File Operations

​Webhook Testing

​Rate Limit Scope

​IP-Based Limiting

​Combined Scoping

​Handling Rate Limits

​Best Practices

​Example: Checking Rate Limits

​Example: Exponential Backoff

​Production Considerations

​Cleanup

​Bypassing Rate Limits

Build docs developers (and LLMs) love

Overview

How Rate Limiting Works

Rate Limit Headers

Rate Limit Tiers

Authentication Endpoints

Rate Limited Response

Sensitive Operations

General API Endpoints

Server Actions

File Operations

Webhook Testing

Rate Limit Scope

IP-Based Limiting

Combined Scoping

Handling Rate Limits

Best Practices

Example: Checking Rate Limits

Example: Exponential Backoff

Production Considerations

Cleanup

Bypassing Rate Limits