Health Check Endpoints
Mission Control exposes three health check endpoints for monitoring and infrastructure orchestration.Available Endpoints
All health endpoints are defined inbackend/app/main.py:500-551.
GET /health
Lightweight liveness probe endpoint.
GET /healthz
Alias liveness probe endpoint for platform compatibility (Kubernetes-style naming).
GET /readyz
Readiness probe endpoint for service orchestration checks. Returns 200 when the service is ready to accept traffic.
System Status Verification
Backend Health
Verify the backend is responding:CORS Configuration
Verify that the correct origin is permitted:Access-Control-Allow-Origin in the response headers.
User Membership and Role
Verify user authentication and role (should returnrole: "owner" for admin access):
List Agents
Verify agent provisioning status:Gateway Status
Check the gateway runtime status via the Mission Control API:backend/app/api/gateway_sessions.py and proxies to the OpenClaw gateway health check.
Monitoring Active Sessions
List active gateway sessions:GET /api/v1/gateway/sessions
Returns:
- Active session keys
- Session labels
- Agent associations
- Last activity timestamps
Session Detail
Get details for a specific session:Session Chat History
Retrieve the chat history for a session:Metrics Endpoint
Mission Control provides aggregated operational metrics via the/api/v1/metrics/dashboard endpoint.
Implementation: backend/app/api/metrics.py:422-484
Query Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
range | 24h | 3d | 7d | 14d | 1m | 3m | 6m | 1y | 24h | Time range for metrics |
board_id | UUID | null | Filter by board |
group_id | UUID | null | Filter by board group |
Response Structure
The endpoint returns: KPIs:active_agents: Number of agents active in the time rangetasks_in_progress: Current count of in-progress taskserror_rate_pct: Percentage of failed activity eventsmedian_cycle_time_hours_7d: Median hours fromin_progress_attodone
throughput: Tasks moved to review status over timecycle_time: Average task completion timeerror_rate: Percentage of errors over timewip: Work-in-progress breakdown by status (inbox, in_progress, review, done)
primary (current range) and comparison (previous range) data.
Service Logs
View live logs for each service:Backend Logs
Frontend Logs
Worker Logs
Service Management
Check Status
Restart Services
Restart a single service:Database Monitoring
Connect to PostgreSQL directly for debugging:Monitoring Best Practices
- Health Checks: Use
/healthzfor liveness and/readyzfor readiness probes - Authentication: Always verify CORS and token configuration if users report access issues
- Logs: Use
journalctl -ffor real-time debugging,-n 100for recent logs - Metrics: Use the dashboard endpoint to track throughput, cycle time, and error rates
- Sessions: Monitor active sessions to verify agents are running and connected