Prerequisites
Before running your first investigation:- Connect at least one cloud provider (AWS, GCP, or Azure)
- Set up an observability integration (Datadog, Grafana, PagerDuty, or Netdata)
- Configure your LLM provider (OpenAI, Anthropic, or OpenRouter)
How Incidents Are Created
Aurora creates incidents automatically when:- Alert webhooks trigger from your observability tools
- Manual investigations are started from the UI
- Correlated alerts are grouped into existing incidents
- A unique incident ID
- Alert metadata (title, severity, service, environment)
- A dedicated chat session for investigation
- Real-time thought streaming
Starting an Investigation
Trigger an alert
Send a test alert from your observability platform. Aurora will automatically:
- Receive the webhook
- Create an incident in the database
- Start a background RCA (Root Cause Analysis) task
View the incident
Navigate to the Incidents page to see your new incident:The incident will show:
- Status:
investigating,analyzed, orresolved - Aurora Status:
idle,running,complete, orerror - Alert Details: title, service, severity, source type
Watch the investigation
Click into the incident to see Aurora’s investigation in real-time:
- Thoughts Tab: Streaming analysis from the LangGraph agent
- Suggestions Tab: Diagnostic commands and fix recommendations
- Citations: Evidence from executed commands (logs, metrics, traces)
Understanding Investigation Output
Thoughts
Thoughts are real-time updates stored inincident_thoughts table:
analysis- Investigation reasoningobservation- Findings from toolshypothesis- Potential root causes
Suggestions
Suggestions are actionable recommendations stored inincident_suggestions:
filePath- File to modifysuggestedContent- Proposed changesrepository- Target repo for PR
Citations
Citations are evidence from tool executions:Chatting with Aurora
You can ask follow-up questions during or after an investigation:
Chat sessions are stored in
chat_sessions and linked via incident_id.
Resolving an Incident
When the issue is fixed:Troubleshooting
Investigation not starting
Investigation not starting
Check Celery worker logs:Verify:
- Redis is running (Celery broker)
- LLM API keys are configured
- Cloud provider credentials are valid
No thoughts appearing
No thoughts appearing
Ensure the WebSocket connection is active:Check
auroraStatus - if error, view worker logs for exceptions.Suggestions failing to execute
Suggestions failing to execute
Diagnostic commands require:
- Valid cloud provider credentials
- Proper IAM/RBAC permissions
- Network access to resources (kubectl, AWS CLI, etc.)
Incident not correlating alerts
Incident not correlating alerts
Alert correlation uses:
- Service name matching
- Time window (5-15 minutes)
- Severity thresholds
incident_alerts table for correlation details:Next Steps
Connect More Sources
Add AWS, GCP, or Azure for deeper investigation capabilities
Set Up Monitoring
Configure Datadog, Grafana, or other observability integrations
Custom Connectors
Build integrations for proprietary systems
Backup & Restore
Set up automated backups for incident data