Overview
The/siaa/status endpoint provides comprehensive information about the SIAA system’s health, including Ollama availability, cache statistics, active users, document index status, and configuration parameters.
Endpoint
Request
No parameters required.Response
Current SIAA proxy version (e.g.,
"2.1.25")Overall system state:
"ok" or "error"Cache statistics object:
Current number of cached responses
Maximum cache capacity (200)
Total cache hits since startup
Total cache misses since startup
Cache hit rate percentage (e.g.,
"38.5%")Time-to-live for cache entries in seconds (3600)
Ollama service availability:
true if available, false if downNumber of consecutive Ollama health check failures
Active LLM model name (e.g.,
"qwen2.5:3b")Indicates if the model has been preloaded into RAM:
true: Model loaded successfullyfalse: Warmup failednull: Warmup not yet attempted
Number of currently active concurrent requests
Total requests served since startup
Total number of documents loaded in the system
Total number of pre-computed document chunks across all documents
Number of unique terms in the density index
Maximum chunk size in characters (800)
Overlap between consecutive chunks in characters (300)
Example Response
Usage Examples
Check System Health
Monitor Cache Performance
Check Ollama Status
List All Collections and Documents
Monitor Active Load
Status Interpretation
Healthy System
estado:"ok"ollama:trueollama_fallos:0warmup_completado:truecache.hit_rate: Above 20% indicates effective caching
Warning Signs
ollama_fallos> 0: Ollama service is experiencing connectivity issueswarmup_completado:false: Model failed to load into RAM, first queries will be slowusuarios_activospersistently high: System may be under heavy loadcache.hit_rate< 10%: Cache is not effective (queries are too diverse)
Critical Issues
estado:"error": Overall system failureollama:false: LLM service unavailable, chat endpoint will return errorstotal_documentos:0: No documents loaded, document queries will fail
Monitoring and Alerts
The status endpoint is designed for:- Health checks: Load balancer probes should check
estado === "ok" - Metrics collection: Export to Prometheus, Grafana, or similar
- Capacity planning: Track
total_atendidosandusuarios_activostrends - Cache optimization: Monitor
hit_rateto evaluate query patterns
Related Endpoints
- GET /siaa/cache - Detailed cache statistics and cache clearing
- GET /siaa/log - Quality monitoring and query logs
- GET /siaa/recargar - Reload document index