Rate Limit Responder

The Rate Limit responder doesn’t block requests itself - instead, it marks matching requests with a header that can be used by the caddy-ratelimit module to apply rate limits.

Overview

This responder is unique because it continues the handler chain rather than terminating the request. It sets an X-RateLimit-Apply header on requests from matching IP ranges, which caddy-ratelimit can then use to apply different rate limits to different client groups.

Requirements

This responder requires the caddy-ratelimit module to be installed and configured separately.

Configuration

ranges

string[]

IP ranges to mark for rate limiting. Can be CIDR notations or predefined service keys.Default: ["aws", "azurepubliccloud", "deepseek", "gcloud", "githubcopilot", "openai"]

whitelist

string[]

Optional list of specific IP addresses to exclude from rate limiting.Default: []

Behavior

When a request matches the configured ranges:

X-RateLimit-Apply

string

Set to "true" on matching requests

Request Flow

string

Continues to next handler in chain (does not terminate)

Examples

{
    order rate_limit after basic_auth
}

:80 {
    defender ratelimit {
        ranges openai aws deepseek
    }
    
    rate_limit {
        zone ai_scrapers {
            match {
                header X-RateLimit-Apply true
            }
            key {remote_host}
            events 10
            window 1m
        }
    }
    
    respond "Content"
}

Implementation Details

The Rate Limit responder is implemented in responders/ratelimit.go:12:

func (r *RateLimitResponder) ServeHTTP(w http.ResponseWriter, req *http.Request, next caddyhttp.Handler) error {
    req.Header.Set("X-RateLimit-Apply", "true")
    
    // Continue with the handler chain
    return next.ServeHTTP(w, req)
}

Unlike other responders, this one:

Modifies the request by adding a header
Calls next.ServeHTTP() to continue the handler chain
Does not terminate the request

How It Works

caddy-ratelimit Integration

The caddy-ratelimit module checks for the header:

rate_limit {
    zone marked_requests {
        match {
            header X-RateLimit-Apply true
        }
        key {remote_host}
        events 10
        window 1m
    }
}

Rate Limit Parameters

events - Number of requests allowed
window - Time window for the limit (e.g., 1m, 1h)
key - What to rate limit by (IP, path, header, etc.)

Use Cases

Tiered Rate Limiting

Apply different limits to different client groups:

defender ratelimit {
    ranges aws gcloud azure
}

rate_limit {
    # Strict limit for cloud providers
    zone cloud {
        match header X-RateLimit-Apply true
        key {remote_host}
        events 5
        window 1m
    }
    
    # Generous limit for others
    zone general {
        key {remote_host}
        events 100
        window 1m
    }
}

API Endpoint Protection

Rate limit API endpoints from AI scrapers:

defender ratelimit {
    ranges openai deepseek
}

rate_limit {
    zone api {
        match {
            path /api/*
            header X-RateLimit-Apply true
        }
        key {remote_host}
        events 2
        window 1m
    }
}

Cost Control

Limit expensive operations for suspected scrapers:

defender ratelimit {
    ranges scrapers bots
}

rate_limit {
    zone expensive {
        match {
            path /search*
            header X-RateLimit-Apply true
        }
        key {remote_host}
        events 1
        window 5m
    }
}

Advantages

Flexible - Doesn’t block, just marks requests for rate limiting
Targeted - Apply different limits to different IP ranges
Graceful - Allows some access, just rate-limited
Customizable - Full control over rate limit policies
Non-blocking - Legitimate traffic not completely blocked

Comparison with Other Responders

vs Block: Rate Limit allows some requests, Block denies all
vs Drop: Rate Limit continues chain, Drop terminates
vs Tarpit: Rate Limit uses module, Tarpit slows directly
vs Custom: Rate Limit marks requests, Custom returns response

Best Practices

Order matters - Place defender before rate_limit in handler order
Set reasonable limits - Don’t make limits too strict
Monitor logs - Check what’s being rate limited
Use multiple zones - Different limits for different scenarios
Consider whitelist - Protect known good IPs from limits

Handler Order

The defender directive must come before rate_limit in the handler chain.

{
    order defender after header
    order rate_limit after defender
}

{
    order rate_limit after basic_auth
}
# defender automatically comes before rate_limit

Testing

Test rate limiting:

# Make multiple requests from blocked IP
for i in {1..20}; do
    curl -H "X-Forwarded-For: 20.202.43.67" http://example.com
    sleep 0.1
done

# Should see 429 Too Many Requests after limit reached

Check if header is being set:

# Use a logging middleware to see headers
curl -v -H "X-Forwarded-For: 20.202.43.67" http://example.com

Troubleshooting

Rate limiting not working

Check handler order - Ensure defender comes before rate_limit
Verify header match - Confirm rate_limit is matching the header
Check IP ranges - Verify the client IP is in configured ranges
Review rate_limit config - Ensure caddy-ratelimit is properly configured

All requests being rate limited

Check ranges - May be too broad (e.g., using all)
Verify whitelist - Add known good IPs to whitelist
Review rate_limit zones - May have multiple zones matching

Advanced Examples

Combined with Other Responders

example.com {
    # Block known bad actors completely
    defender block {
        ranges known-bad-ips
    }
    
    # Rate limit cloud providers
    defender ratelimit {
        ranges aws gcloud azure
    }
    
    rate_limit {
        zone cloud {
            match header X-RateLimit-Apply true
            key {remote_host}
            events 10
            window 1m
        }
    }
    
    respond "Content"
}

Custom Rate Limit Response

rate_limit {
    zone strict {
        match header X-RateLimit-Apply true
        key {remote_host}
        events 5
        window 1m
        
        # Custom response when rate limited
        respond {
            status_code 429
            body "Too Many Requests - Try Again Later"
        }
    }
}

caddy-ratelimit Documentation - Full rate limiting module docs
Block Responder - For complete blocking instead
Tarpit Responder - For slowing down responses
Handler Order - Caddy handler ordering

Responders

IP Ranges

Fetchers

Overview

Requirements

Configuration

Behavior

Examples

Implementation Details

How It Works

caddy-ratelimit Integration

Rate Limit Parameters

Use Cases

Tiered Rate Limiting

API Endpoint Protection

Cost Control

Advantages

Comparison with Other Responders

Best Practices

Handler Order

Testing

Troubleshooting

Rate limiting not working

All requests being rate limited

Advanced Examples

Combined with Other Responders

Custom Rate Limit Response

Build docs developers (and LLMs) love

Responders

IP Ranges

Fetchers

​Overview

​Requirements

​Configuration

​Behavior

​Examples

​Implementation Details

​How It Works

​caddy-ratelimit Integration

​Rate Limit Parameters

​Use Cases

​Tiered Rate Limiting

​API Endpoint Protection

​Cost Control

​Advantages

​Comparison with Other Responders

​Best Practices

​Handler Order

​Testing

​Troubleshooting

​Rate limiting not working

​All requests being rate limited

​Advanced Examples

​Combined with Other Responders

​Custom Rate Limit Response

​Related Documentation

Build docs developers (and LLMs) love

Overview

Requirements

Configuration

Behavior

Examples

Implementation Details

How It Works

caddy-ratelimit Integration

Rate Limit Parameters

Use Cases

Tiered Rate Limiting

API Endpoint Protection

Cost Control

Advantages

Comparison with Other Responders

Best Practices

Handler Order

Testing

Troubleshooting

Rate limiting not working

All requests being rate limited

Advanced Examples

Combined with Other Responders

Custom Rate Limit Response

Related Documentation