Overview
Generate a comprehensive MLOps Strategy document that defines how ML/AI models will be developed, deployed, monitored, and governed throughout their lifecycle. Ensures AI systems are reliable, reproducible, and compliant with governance requirements.When to Use
Use/arckit.mlops when your project includes:
- Machine Learning models (classification, regression, NLP, computer vision)
- Large Language Models (LLMs) or Generative AI
- Algorithmic decision-making systems
- AI-assisted automation
/arckit.requirements- understand ML use cases/arckit.data-model- understand training data/arckit.ai-playbook- governance context (UK Gov)
Usage
What It Creates
File:projects/{project}/ARC-{PID}-MLOP-v1.0.md
Sections:
- ML Use Cases - Model purpose, inputs, outputs, performance targets
- Model Lifecycle - Training, validation, deployment, monitoring, retirement
- MLOps Maturity - Level 0-4 assessment and target state
- Training Pipeline - Data prep, feature engineering, training, evaluation
- Serving Infrastructure - Batch vs real-time, scaling, latency SLAs
- Feature Store - Feature engineering, storage, serving
- Model Monitoring - Drift detection, performance degradation, fairness
- Model Registry - Versioning, lineage, metadata
- Responsible AI - Bias testing, explainability, human oversight
- Governance - UK AI Playbook, ATRS, MOD JSP 936 (if applicable)
- Cost Management - Training costs, inference costs, optimization
MLOps Maturity Levels
| Level | Characteristics | Automation |
|---|---|---|
| Level 0 | Manual process, scripts, notebooks | None |
| Level 1 | ML pipeline automation | Training |
| Level 2 | CI/CD for ML pipelines | Training + Deployment |
| Level 3 | Automated monitoring and retraining | Full lifecycle |
| Level 4 | Self-healing, auto-scaling, A/B testing | Autonomous |
Supported ML Platforms
- AWS SageMaker - Managed training, hosting, Pipelines
- Google Vertex AI - Unified ML platform, AutoML, Workbench
- Azure Machine Learning - Designer, Pipelines, Endpoints
- MLflow - Open source experiment tracking and serving
- Kubeflow - Kubernetes-native ML workflows
UK Government AI Compliance
Integrates with:/arckit.ai-playbook- 10 AI principles, 6 ethical themes/arckit.atrs- Algorithmic Transparency Recording Standard/arckit.jsp-936- MOD AI assurance (defence projects)
Related Commands
AI Playbook
UK Government AI Playbook compliance
DevOps
CI/CD pipelines for model deployment
FinOps
ML infrastructure cost optimization
Data Model
Training data schema and governance