Overview
SLA (Service Level Agreement) tracking helps you monitor whether your services meet uptime and performance commitments. Pongo automatically calculates uptime percentages and displays them on dashboards alongside your defined SLA targets.Configuring SLA Targets
Set SLA targets at the dashboard level:Common SLA Targets
| SLA Target | Downtime per Year | Downtime per Month | Downtime per Day |
|---|---|---|---|
| 99% | 3.65 days | 7.31 hours | 14.40 minutes |
| 99.5% | 1.83 days | 3.65 hours | 7.20 minutes |
| 99.9% | 8.77 hours | 43.83 minutes | 1.44 minutes |
| 99.95% | 4.38 hours | 21.92 minutes | 43.20 seconds |
| 99.99% | 52.60 minutes | 4.38 minutes | 8.64 seconds |
| 99.999% | 5.26 minutes | 26.30 seconds | 0.86 seconds |
Uptime Calculation
Pongo calculates uptime percentage using check results:Status Classification
- Up: Counts toward successful checks
- Down: Counts as failure
- Degraded: Configurable (typically counts as partial success)
Time Windows
Uptime is calculated across multiple time windows:- Last 24 hours
- Last 7 days
- Last 30 days
- Last 90 days
- All time
Response Time Tracking
Beyond uptime, SLA tracking includes response time metrics:Latency Percentiles
- P50 (median): 50% of requests are faster than this
- P95: 95% of requests are faster than this (excludes outliers)
- P99: 99% of requests are faster than this (catches slowest 1%)
Average Response Time
Calculated across all successful checks in the time window.Status Distribution
Dashboards display how much time services spend in each state:- Chronic issues: High percentage of “down” time
- Performance problems: Frequent “degraded” status
- Stability: Consistent “up” status
Dashboard Display
SLA Indicator
Dashboards show current uptime against the target:- Green checkmark: Meeting SLA
- Red warning: Below SLA target
Visual Components
- Uptime Badge: Prominent display of current uptime percentage
- SLA Progress Bar: Visual indicator of target achievement
- Status Distribution Pie Chart: Breakdown of up/degraded/down time
- Response Time Chart: Historical response time trends with P50/P95/P99 lines
- Uptime Bars: Color-coded timeline showing service status over time
Monitor-Level Metrics
Each monitor in a dashboard tracks its own metrics:- Identify which services are impacting overall SLA
- Prioritize improvements based on worst performers
- Track individual service commitments
Setting Performance Thresholds
Use monitor handlers to define what constitutes “degraded” vs “up”:Alert Integration
Combine SLA tracking with alerts to notify when targets are at risk:Public Status Page Display
Whenpublic: true, status pages show:
- Current uptime percentage with SLA target
- Response time charts with historical trends
- Latency percentiles (P50, P95, P99)
- Status distribution pie chart
- Uptime bars showing 90-day history
Best Practices
Set Realistic Targets
Start with achievable SLA targets (99% or 99.5%) and increase as reliability improves.
Include Degraded States
Don’t just track up/down—use “degraded” for slow responses or partial outages.
Monitor Multiple Time Windows
Track 24h, 7d, and 30d uptime to spot both acute and chronic issues.
Alert Before Breach
Set up early warning alerts before you’re in danger of missing SLA commitments.
Calculating SLA Credits
Many SLA agreements include service credits for downtime. Use check results to calculate:Related
Dashboards
Configure dashboards with SLA targets
Monitors
Create monitors with performance thresholds
Alerts
Set up alerts for SLA breach risks
Status Pages
Display SLA metrics on public status pages