Overview
This guide provides systematic troubleshooting approaches for common vCluster issues. Use the diagnostic commands and solutions to quickly identify and resolve problems.General Troubleshooting Approach
Identify the Issue
Clearly define what’s not working:
- What were you trying to do?
- What happened instead?
- When did it start?
- Has it ever worked?
Review Changes
What changed recently?
- Configuration updates
- Version upgrades
- Infrastructure changes
- New deployments
Isolate the Problem
Test components individually:
- Host cluster connectivity
- Virtual cluster API server
- Resource syncing
- Network policies
Common Issues and Solutions
Connection and Access Issues
Cannot Connect to vCluster
Cannot Connect to vCluster
Symptoms:Solutions:
vcluster connecthangs or times out- Connection refused errors
- Authentication failures
-
Restart vCluster pods:
-
Check service account permissions:
-
Verify network policies:
-
Check for resource constraints:
-
Try reconnecting with verbose output:
kubectl Commands Timing Out
kubectl Commands Timing Out
Symptoms:Solutions:
- Commands hang indefinitely
- Timeouts after several seconds
- Intermittent connectivity
-
Increase timeout:
-
Check etcd performance:
-
Reduce cluster load:
- Scale down non-critical workloads
- Check for resource-intensive operations
- Review API server logs for high-frequency requests
-
Increase API server resources:
Authentication/Authorization Errors
Authentication/Authorization Errors
Resource Syncing Issues
Resources Not Syncing to Host
Resources Not Syncing to Host
Symptoms:Solutions:
- Resources created in vCluster don’t appear in host namespace
- Pods stay pending indefinitely
- Services not accessible from host
-
Verify sync configuration:
-
Restart syncer:
-
Check service account permissions:
-
Review resource quotas:
-
Check for naming conflicts:
Pods Stuck in Pending
Pods Stuck in Pending
Symptoms:Solutions:
- Pods created but never start
- Status remains “Pending”
- Containers not running
-
Insufficient resources:
-
PVC not bound:
-
Image pull failures:
-
Node selectors/taints:
Services Not Accessible
Services Not Accessible
Symptoms:Solutions:
- Services created but not reachable
- Connection refused or timeout
- DNS resolution failures
-
Verify service sync:
-
Check endpoints:
-
Test connectivity:
-
Review network policies:
Performance Issues
High CPU/Memory Usage
High CPU/Memory Usage
Symptoms:Solutions:
- Slow response times
- OOMKilled pods
- Throttling warnings
-
Increase resource limits:
-
Enable resource limiting in virtual cluster:
-
Optimize workload placement:
-
Profile and optimize:
Slow API Response Times
Slow API Response Times
Symptoms:Solutions:
- kubectl commands take long to complete
- API timeouts
- Unresponsive control plane
- Increase API server resources (see above)
-
Optimize etcd:
-
Reduce API load:
-
Enable API priority and fairness:
Stability Issues
Frequent Pod Restarts
Frequent Pod Restarts
Symptoms:Solutions:
- Pods restarting repeatedly
- CrashLoopBackOff status
- High restart count
-
OOM kills - increase memory:
-
Liveness probe too aggressive:
-
Application errors - check logs:
-
Resource contention:
Data Loss or Corruption
Data Loss or Corruption
Symptoms:Solutions:
- Resources disappearing
- Configuration resets
- State inconsistencies
-
Restore from backup:
-
Fix PVC issues:
-
Enable persistent storage:
Advanced Debugging
Enable Debug Logging
Interactive Debugging Shell
Network Debugging
Collect Comprehensive Debug Info
Getting Help
If you’re still experiencing issues:GitHub Issues
Search or create an issue: github.com/loft-sh/vcluster/issues
Slack Community
Join the community: vcluster.com/slack
Documentation
Browse docs: vcluster.com/docs
Support
Enterprise support: Contact your account team
When Reporting Issues
Include:-
Environment details:
- vCluster version
- Kubernetes version (host and virtual)
- Cloud provider/platform
- Installation method (Helm, Platform, etc.)
-
Reproduction steps:
- What you did
- What you expected
- What actually happened
-
Debug information:
- Output of
vcluster debug collect - Relevant logs
- Configuration files (sanitized)
- Error messages
- Output of
-
Attempted solutions:
- What you’ve tried
- Results of each attempt
Next Steps
Monitoring
Set up monitoring to catch issues early
Managing vClusters
Return to general management operations