Overview
Generate a comprehensive Operational Readiness Pack that prepares a service for production operation. Bridges the gap between development completion and live service, ensuring operations teams have everything needed to support the service.When to Use
Use/arckit.operationalize before go-live to ensure operational readiness.
Run after:
/arckit.requirements- for SLA targets/arckit.diagram- for component inventory/arckit.hld-reviewor/arckit.dld-review- for technical details/arckit.data-model- for data dependencies
Complementary to
/arckit.servicenow (ITSM tooling) - this command focuses on operational practices and documentation.Usage
What It Creates
File:projects/{project}/ARC-{PID}-OPER-v1.0.md
Sections:
- Service Overview - Purpose, criticality, SLA summary
- Support Model - Support tiers, escalation paths, on-call schedule
- SLIs and SLOs - Service Level Indicators, Objectives, Error Budgets
- Runbooks - Standard operating procedures for common tasks
- Incident Response - Playbooks for incident categories
- Monitoring and Alerting - Metrics, dashboards, alert thresholds
- Disaster Recovery - Backup/restore procedures, RTO/RPO
- Business Continuity - Failover plans, degraded mode operation
- Change Management - Deployment procedures, rollback plans
- Capacity Management - Scaling triggers, resource planning
- Handover Documentation - Onboarding guides, knowledge transfer
- Toil Analysis - Repetitive tasks, automation opportunities
Service Tiers
| Tier | Availability Target | Support Hours | Max Incident Response |
|---|---|---|---|
| Critical | 99.95% | 24/7/365 | 15 minutes |
| Important | 99.5% | 16/5 | 1 hour |
| Standard | 99.0% | 9/5 | 4 hours |
SRE Principles
Follows Site Reliability Engineering best practices:- SLIs - Quantitative service quality measures (latency, availability, error rate)
- SLOs - Target values for SLIs (e.g., 99.9% availability)
- Error Budgets - Acceptable downtime to balance reliability vs velocity
- Toil Reduction - Automate repetitive manual operational work
UK Government Compliance
- GDS Service Standard Point 14 - Operate a reliable service
- TCoP Point 6 - Make things secure (operational security)
- ITIL v4 - Incident, Problem, Change, Capacity Management
Related Commands
ServiceNow
ITSM tooling design (CMDB, SLAs, incidents)
DevOps
CI/CD pipelines and automation
FinOps
Cloud cost management and optimization
Diagram
Architecture diagrams for runbooks