Cluster Modes

Overview

Redis Operator supports two cluster modes:

Standalone — A single primary with N-1 replicas, managed directly by the operator
Sentinel — Redis Sentinel provides high availability monitoring and automatic failover

A third mode, Cluster (Redis Cluster sharding), is reserved for future implementation and currently rejected by the webhook.

apiVersion: redis.io/v1
kind: RedisCluster
metadata:
  name: example
spec:
  mode: standalone  # or "sentinel"
  instances: 3

See api/v1/rediscluster_types.go:9-18 for the mode enum definition.

Standalone Mode

Default mode: spec.mode: standalone

Architecture

One primary, N-1 replicas: The operator selects a primary pod and configures all others as replicas
Operator-managed failover: The controller detects primary failure and promotes a replica
Client routing: Applications use the -leader Service for writes, -replica for reads

When to Use Standalone

Small to medium clusters (1-10 instances)
Predictable failover timing: You want full control over when and how failover happens
No external dependencies: No Sentinel instances to manage
Simple client configuration: Clients connect to <cluster>-leader:6379

Failover Behavior

See Failover and Fencing for the complete failover sequence. Summary:

Controller detects primary unreachable (HTTP poll timeout)
Fence the former primary
Select replica with smallest replication lag
Issue POST /v1/promote to the replica’s pod IP
Update -leader Service selector to the new primary
Update status.currentPrimary
Clear fence; former primary restarts as replica

Failover time: Typically 10-30 seconds (depends on --status-poll-interval and HTTP timeout settings).

Configuration Example

apiVersion: redis.io/v1
kind: RedisCluster
metadata:
  name: standalone-example
spec:
  mode: standalone
  instances: 3
  storage:
    size: 10Gi
    storageClassName: fast-ssd
  resources:
    requests:
      memory: 2Gi
      cpu: 1000m
    limits:
      memory: 2Gi
  minSyncReplicas: 1  # Require at least 1 replica to acknowledge writes

minSyncReplicas and maxSyncReplicas control Redis’s min-replicas-to-write and min-replicas-max-lag settings for synchronous replication.

Service Endpoints

Service	Selector	Purpose
`standalone-example-leader`	`redis.io/role: primary`	Write traffic
`standalone-example-replica`	`redis.io/role: replica`	Read-only traffic
`standalone-example-any`	`redis.io/cluster: standalone-example`	Any instance

Primary Selection

On initial cluster creation:

The operator creates Pod-0 first with no REPLICAOF directive (primary)
Subsequent pods start with REPLICAOF <pod-0-ip> 6379
status.currentPrimary is set to standalone-example-0
The -leader Service selector is updated to route to Pod-0

During failover, the operator selects the replica with the smallest replicaLagBytes (closest to the former primary’s replication offset). See internal/controller/cluster/pods.go.

Sentinel Mode

High-availability mode: spec.mode: sentinel

Architecture

Sentinel instances: The operator creates a separate StatefulSet of Redis Sentinel processes
Sentinel-managed failover: Sentinels monitor the primary and automatically promote a replica on failure
Operator orchestration: The controller configures Sentinels, updates Services, and handles Sentinel failures
Client routing: Applications connect to Sentinel for primary discovery

When to Use Sentinel

Large clusters (10+ instances)
Automatic failover without operator dependency: Sentinel can fail over even if the operator controller is down
Standard Redis HA pattern: Your team already uses Sentinel in production
Client library support: Your client library has built-in Sentinel support

Sentinel Configuration

Defaults (api/v1/rediscluster_types.go:82-87):

Sentinel instances: 3
Sentinel port: 26379
Quorum: 2 (majority of 3)

Example:

apiVersion: redis.io/v1
kind: RedisCluster
metadata:
  name: sentinel-example
spec:
  mode: sentinel
  instances: 5  # Data pods: 1 primary + 4 replicas
  storage:
    size: 20Gi

This creates:

5 data pods: sentinel-example-0 through sentinel-example-4
3 Sentinel pods: sentinel-example-sentinel-0 through sentinel-example-sentinel-2
Sentinel Service: sentinel-example-sentinel (26379)

Sentinel Failover Flow

Sentinels monitor the primary via PING health checks
When quorum Sentinels agree the primary is down (default: 2 of 3), they elect a new primary
Sentinels run REPLICAOF NO ONE on the selected replica
Sentinels reconfigure other replicas to follow the new primary
The operator detects the new primary via Sentinel status polling
The operator updates status.currentPrimary and the -leader Service selector

Sentinel failover is typically faster than operator-managed failover (5-15 seconds) because Sentinels monitor the primary continuously.

Client Configuration

Sentinel-aware clients (e.g., redis-py, Jedis, node-redis with Sentinel support):

from redis.sentinel import Sentinel

sentinel = Sentinel([
    ('sentinel-example-sentinel', 26379)
])
primary = sentinel.master_for('mymaster', socket_timeout=0.1)
replica = sentinel.slave_for('mymaster', socket_timeout=0.1)

primary.set('key', 'value')  # Write to primary
value = replica.get('key')    # Read from replica

Standard clients (use Kubernetes Services):

sentinel-example-leader:6379 — Write traffic (operator updates selector on failover)
sentinel-example-replica:6379 — Read-only traffic

Operator vs. Sentinel Responsibilities

Responsibility	Standalone Mode	Sentinel Mode
Primary health monitoring	Operator (HTTP poll)	Sentinel (`PING`)
Replica selection	Operator (smallest lag)	Sentinel (configured priority)
Promotion command	Operator (`POST /v1/promote`)	Sentinel (`REPLICAOF NO ONE`)
Service selector update	Operator	Operator
`status.currentPrimary` update	Operator	Operator
Replica reconfiguration	Operator	Sentinel

Even in Sentinel mode, the operator remains authoritative for Kubernetes resources (Services, Pods, PVCs). Sentinel handles Redis-level failover; the operator synchronizes Kubernetes state.

Sentinel Pod Lifecycle

Sentinel pods are managed by a StatefulSet (data pods are not). The operator:

Creates the Sentinel StatefulSet with 3 replicas (default)
Configures sentinel.conf via ConfigMap
Monitors Sentinel health via sentinel.status.sentinelReadyInstances
Updates Sentinel configuration when the primary changes

See internal/controller/cluster/sentinel.go and internal/instance-manager/run/sentinel.go.

Cluster Mode (Reserved)

Status: Not implemented. The webhook rejects spec.mode: cluster. Planned features:

Redis Cluster sharding (16384 hash slots)
Horizontal scalability across multiple primaries
Automatic data distribution and rebalancing
Client-side cluster protocol support

spec.mode: cluster is reserved and intentionally not implemented. Do not attempt to use it. The webhook will reject the resource with an error.

See AGENTS.md:25.

Mode Comparison

Feature	Standalone	Sentinel	Cluster (Reserved)
Primary pods	1	1	N (multiple primaries)
Replica pods	N-1	N-1	M per primary
Sentinel pods	0	3 (default)	0
Failover control	Operator	Sentinel + Operator	Redis Cluster
Failover time	10-30s	5-15s	5s
Scalability	Vertical (bigger pods)	Vertical	Horizontal (more primaries)
Client complexity	Low (single endpoint)	Medium (Sentinel protocol)	High (cluster protocol)
Operator dependency	High (no operator = no failover)	Low (Sentinel can fail over)	Low
Use case	Small/medium clusters	Large HA clusters	Multi-tenant, high throughput

Switching Modes

Changing spec.mode on an existing cluster is not supported. You must create a new cluster and migrate data.

Migration path:

Create a new RedisCluster with the desired mode
Use RedisBackup to back up the old cluster
Restore the backup into the new cluster via spec.bootstrap.backupName
Update application endpoints to the new cluster’s Services
Delete the old cluster

Alternatively, use Replica Mode to set up external replication from the old cluster to the new cluster, then promote the new cluster.

Next Steps

Failover and Fencing — Deep dive into failover mechanics
Architecture — Understand the split control/data plane
Comparison — How Redis Operator compares to alternatives

Get Started

Core Concepts

Configuration

Operations

Runbooks

Overview

Standalone Mode

Architecture

When to Use Standalone

Failover Behavior

Configuration Example

Service Endpoints

Primary Selection

Sentinel Mode

Architecture

When to Use Sentinel

Sentinel Configuration

Sentinel Failover Flow

Client Configuration

Operator vs. Sentinel Responsibilities

Sentinel Pod Lifecycle

Cluster Mode (Reserved)

Mode Comparison

Switching Modes

Next Steps

Build docs developers (and LLMs) love

Get Started

Core Concepts

Configuration

Operations

Runbooks

​Overview

​Standalone Mode

​Architecture

​When to Use Standalone

​Failover Behavior

​Configuration Example

​Service Endpoints

​Primary Selection

​Sentinel Mode

​Architecture

​When to Use Sentinel

​Sentinel Configuration

​Sentinel Failover Flow

​Client Configuration

​Operator vs. Sentinel Responsibilities

​Sentinel Pod Lifecycle

​Cluster Mode (Reserved)

​Mode Comparison

​Switching Modes

​Next Steps

Build docs developers (and LLMs) love

Overview

Standalone Mode

Architecture

When to Use Standalone

Failover Behavior

Configuration Example

Service Endpoints

Primary Selection

Sentinel Mode

Architecture

When to Use Sentinel

Sentinel Configuration

Sentinel Failover Flow

Client Configuration

Operator vs. Sentinel Responsibilities

Sentinel Pod Lifecycle

Cluster Mode (Reserved)

Mode Comparison

Switching Modes

Next Steps