Estimated time: 10-15 minutes Use this when a
RedisCluster is not converging and reconcile loops are not making visible progress.
Symptoms
status.phaseremains unchanged (for exampleDegraded) for multiple reconcile intervals.- Expected actions (pod recreate, service selector update, PVC creation) are not happening.
- Cluster events show repeated failures or no recent reconcile activity.
Prerequisites
kubectlaccess to the cluster and operator namespace.- Shell variables:
Diagnosis
Recovery Steps
Verification
- Reconcile activity resumes in logs/events.
- Status fields begin changing again.
- Cluster trends back to
Healthy.
Escalation
- If reconcile restarts but repeatedly fails with the same error, escalate with:
RedisClusterYAML- recent events
- operator logs around the failure
- If leader election lease is not being renewed, investigate controller-manager health and RBAC before retrying.