Diagnostic Approach
Query Performance Issues
High Query Latency
Symptoms
Symptoms
- Slow query responses (> 1 second)
- Increasing query latency over time
- Timeout errors
Common Causes and Solutions
Common Causes and Solutions
1. Inefficient Rank ProfilesSolution:2. Thread Pool SaturationSolution:3. Large Result Sets
Symptom: Queries with hits=1000 are slowSolution:4. Expensive Grouping Operations
Solution: Optimize grouping queries:
Query Errors
Feeding Issues
Slow Document Ingestion
Symptoms
Symptoms
- Low feed throughput (< 100 docs/sec on capable hardware)
- High feed latency
- Growing queue of pending operations
Solutions
Solutions
1. Use Async Operations2. Check Resource LimitsIf feeding is blocked:3. Optimize Document Structure
- Remove unnecessary fields
- Use appropriate field types
- Enable compression for large text fields
Feed Failures
Memory Issues
High Memory Usage
Check Attribute Usage
Attributes are stored in memory. Large or high-cardinality attributes consume significant memory.
JVM Memory Issues (Containers)
High GC Overhead
High GC Overhead
- GC pauses > 500ms
- GC consuming > 10% CPU
- Heap consistently > 80% used
- Increase heap size:
- Tune GC:
- Identify memory leaks:
Disk Issues
Disk Full
Slow Disk I/O
Symptoms and Solutions
Symptoms and Solutions
Symptoms:Solutions:Target hit rate > 90%.
- High query latency
- Slow feed operations
- High disk queue depth
- Optimize cache usage:
- Monitor cache hit rates:
- Use faster storage:
- NVMe SSDs for best performance
- Ensure sufficient IOPS provisioned
Network Issues
Connection Timeouts
High Connection Count
- Check for connection leaks in clients
- Tune connection timeouts:
Cluster State Issues
Node Down
Common Issues
Out of memory:Solution: Increase heap size or reduce memory usageDisk full:Solution: Free disk space or add capacityConfiguration error:Solution: Review and fix services.xml
Split Brain / Cluster State Divergence
Solution:- Check cluster controller:
- Force cluster state update:
- If issues persist, restart cluster controller
Performance Regression
Diagnosis Steps
Diagnosis Steps
1. Compare metrics before/after:2. Review recent changes:4. Profile queries:
- Configuration changes
- Schema modifications
- Application updates
- Infrastructure changes
Debugging Tools
Log Analysis
Metric Queries
Query Tracing
Getting Help
Vespa Slack
Join the community for real-time help
GitHub Issues
Report bugs and request features
Stack Overflow
Search existing questions or ask new ones
Documentation
Browse official documentation
Next Steps
Monitoring
Set up proactive monitoring
Tuning
Optimize performance
Scaling
Scale your cluster