GET /siaa/status

Overview

The /siaa/status endpoint provides comprehensive information about the SIAA system’s health, including Ollama availability, cache statistics, active users, document index status, and configuration parameters.

Endpoint

GET /siaa/status

Request

No parameters required.

Response

version

string

Current SIAA proxy version (e.g., "2.1.25")

estado

string

Overall system state: "ok" or "error"

cache

object

Cache statistics object:

entradas

number

Current number of cached responses

max

number

Maximum cache capacity (200)

hits

number

Total cache hits since startup

misses

number

Total cache misses since startup

hit_rate

string

Cache hit rate percentage (e.g., "38.5%")

ttl_seg

number

Time-to-live for cache entries in seconds (3600)

ollama

boolean

Ollama service availability: true if available, false if down

ollama_fallos

number

Number of consecutive Ollama health check failures

modelo

string

Active LLM model name (e.g., "qwen2.5:3b")

warmup_completado

boolean | null

Indicates if the model has been preloaded into RAM:

true: Model loaded successfully
false: Warmup failed
null: Warmup not yet attempted

usuarios_activos

number

Number of currently active concurrent requests

total_atendidos

number

Total requests served since startup

total_documentos

number

Total number of documents loaded in the system

total_chunks

number

Total number of pre-computed document chunks across all documents

indice_terminos

number

Number of unique terms in the density index

chunk_size

number

Maximum chunk size in characters (800)

chunk_overlap

number

Overlap between consecutive chunks in characters (300)

colecciones

object

Object mapping collection names to their document lists and totals:

[collection_name]

object

docs

array

Array of document filenames in this collection

total

number

Total documents in this collection

Example Response

{
  "version": "2.1.25",
  "estado": "ok",
  "cache": {
    "entradas": 87,
    "max": 200,
    "hits": 245,
    "misses": 392,
    "hit_rate": "38.5%",
    "ttl_seg": 3600
  },
  "ollama": true,
  "ollama_fallos": 0,
  "modelo": "qwen2.5:3b",
  "warmup_completado": true,
  "usuarios_activos": 2,
  "total_atendidos": 1847,
  "total_documentos": 18,
  "total_chunks": 486,
  "indice_terminos": 12847,
  "chunk_size": 800,
  "chunk_overlap": 300,
  "colecciones": {
    "general": {
      "docs": [
        "acuerdo_no._psaa16-10476.md",
        "acuerdo_pcsja19-11207.md",
        "circular_cendoj_10-2022.md"
      ],
      "total": 3
    },
    "juzgados": {
      "docs": [
        "juzgado_civil_municipal.md",
        "juzgado_penal_circuito.md",
        "juzgado_familia.md"
      ],
      "total": 3
    }
  }
}

Usage Examples

Check System Health

curl http://localhost:5000/siaa/status

Monitor Cache Performance

curl http://localhost:5000/siaa/status | jq '.cache'

Example output:

{
  "entradas": 87,
  "max": 200,
  "hits": 245,
  "misses": 392,
  "hit_rate": "38.5%",
  "ttl_seg": 3600
}

Check Ollama Status

curl http://localhost:5000/siaa/status | jq '{ollama, modelo, warmup_completado, ollama_fallos}'

Example output:

{
  "ollama": true,
  "modelo": "qwen2.5:3b",
  "warmup_completado": true,
  "ollama_fallos": 0
}

List All Collections and Documents

curl http://localhost:5000/siaa/status | jq '.colecciones'

Monitor Active Load

watch -n 2 'curl -s http://localhost:5000/siaa/status | jq "{usuarios_activos, total_atendidos}"'

This command refreshes every 2 seconds to show current load.

Status Interpretation

Healthy System

estado: "ok"
ollama: true
ollama_fallos: 0
warmup_completado: true
cache.hit_rate: Above 20% indicates effective caching

Warning Signs

ollama_fallos > 0: Ollama service is experiencing connectivity issues
warmup_completado: false: Model failed to load into RAM, first queries will be slow
usuarios_activos persistently high: System may be under heavy load
cache.hit_rate < 10%: Cache is not effective (queries are too diverse)

Critical Issues

estado: "error": Overall system failure
ollama: false: LLM service unavailable, chat endpoint will return errors
total_documentos: 0: No documents loaded, document queries will fail

Monitoring and Alerts

The status endpoint is designed for:

Health checks: Load balancer probes should check estado === "ok"
Metrics collection: Export to Prometheus, Grafana, or similar
Capacity planning: Track total_atendidos and usuarios_activos trends
Cache optimization: Monitor hit_rate to evaluate query patterns

GET /siaa/cache - Detailed cache statistics and cache clearing
GET /siaa/log - Quality monitoring and query logs
GET /siaa/recargar - Reload document index

Endpoints

Utility Endpoints

GET /siaa/status

Overview

Endpoint

Request

Response

Example Response

Usage Examples

Check System Health

Monitor Cache Performance

Check Ollama Status

List All Collections and Documents

Monitor Active Load

Status Interpretation

Healthy System

Warning Signs

Critical Issues

Monitoring and Alerts

Build docs developers (and LLMs) love

Endpoints

Utility Endpoints

​Overview

​Endpoint

​Request

​Response

​Example Response

​Usage Examples

​Check System Health

​Monitor Cache Performance

​Check Ollama Status

​List All Collections and Documents

​Monitor Active Load

​Status Interpretation

​Healthy System

​Warning Signs

​Critical Issues

​Monitoring and Alerts

​Related Endpoints

Build docs developers (and LLMs) love

Overview

Endpoint

Request

Response

Example Response

Usage Examples

Check System Health

Monitor Cache Performance

Check Ollama Status

List All Collections and Documents

Monitor Active Load

Status Interpretation

Healthy System

Warning Signs

Critical Issues

Monitoring and Alerts

Related Endpoints