Skip to main content

DevOps & Infrastructure Components

The DevOps & Infrastructure category includes specialists for cloud platforms, container orchestration, infrastructure as code, CI/CD pipelines, and site reliability engineering.

Browse DevOps Components

Explore all DevOps and infrastructure components at aitmpl.com

Quick Install

# Install DevOps essentials
npx claude-code-templates@latest \
  --agent devops-engineer \
  --agent cloud-architect \
  --agent kubernetes-specialist \
  --agent terraform-engineer

# Or install the DevOps template
npx claude-code-templates@latest --template devops-pipeline

Core DevOps Agents

DevOps Engineer

Complete DevOps specialist for infrastructure and deploymentExpertise:
  • CI/CD pipeline design and implementation
  • Container orchestration (Docker, Kubernetes)
  • Infrastructure as Code (Terraform, Pulumi)
  • Cloud platforms (AWS, Azure, GCP)
  • Monitoring and observability
  • Incident response
Technologies:
  • Docker, Kubernetes, Helm
  • GitHub Actions, GitLab CI, Jenkins
  • Terraform, Ansible, Pulumi
  • Prometheus, Grafana, DataDog
  • AWS/Azure/GCP CLIs and SDKs
npx claude-code-templates@latest --agent devops-engineer
Example Usage:
@devops-engineer Set up a production-ready Kubernetes cluster
with auto-scaling, monitoring, and CI/CD pipeline for our
microservices application.

Cloud Architect

Multi-cloud architecture and infrastructure designExpertise:
  • Cloud architecture patterns
  • Multi-cloud and hybrid cloud
  • Cost optimization
  • Security and compliance
  • High availability design
  • Disaster recovery
Platforms:
  • AWS (EC2, S3, RDS, Lambda, ECS, EKS)
  • Azure (VMs, Blob Storage, SQL, Functions, AKS)
  • GCP (Compute Engine, Cloud Storage, Cloud Run, GKE)
npx claude-code-templates@latest --agent cloud-architect

Platform Engineer

Platform engineering and developer experienceExpertise:
  • Internal developer platforms
  • Developer tooling
  • Self-service infrastructure
  • Platform observability
  • Golden paths
npx claude-code-templates@latest --agent platform-engineer

SRE Engineer

Site Reliability Engineering for production systemsExpertise:
  • Service Level Objectives (SLOs)
  • Incident response and postmortems
  • Reliability engineering
  • Capacity planning
  • Automation and toil reduction
  • Production debugging
npx claude-code-templates@latest --agent sre-engineer

Container Orchestration

Kubernetes Specialist

Kubernetes deployment, scaling, and operationsExpertise:
  • Kubernetes architecture
  • Deployments, StatefulSets, DaemonSets
  • Services, Ingress, NetworkPolicies
  • ConfigMaps and Secrets management
  • Helm charts and Kustomize
  • Service mesh (Istio, Linkerd)
  • Autoscaling (HPA, VPA, Cluster Autoscaler)
  • Monitoring and logging
Use cases:
  • Microservices deployment
  • Stateful application management
  • Multi-tenant clusters
  • Blue-green and canary deployments
npx claude-code-templates@latest --agent kubernetes-specialist
Example Usage:
@kubernetes-specialist Deploy our microservices to Kubernetes
with:
- Horizontal Pod Autoscaling based on CPU and custom metrics
- Ingress with TLS termination
- Network policies for pod-to-pod communication
- Prometheus monitoring

Microservices Architect

Microservices architecture and design patterns
npx claude-code-templates@latest --agent microservices-architect

Infrastructure as Code

Terraform Specialists

Terraform Engineer

Infrastructure as Code with TerraformExpertise:
  • Terraform configuration
  • Module development
  • State management
  • Multi-environment setups
  • Provider configuration
  • Best practices
npx claude-code-templates@latest --agent terraform-engineer

Terraform Specialist

Advanced Terraform patterns and optimization
npx claude-code-templates@latest --agent terraform-specialist

Terragrunt Expert

Terragrunt for DRY Terraform configurations
npx claude-code-templates@latest --agent terragrunt-expert

Terraform IaC Reviewer

Review Terraform code for best practices
npx claude-code-templates@latest --agent terraform-iac-reviewer

Deployment & CI/CD

Deployment Engineer

Deployment automation and release managementExpertise:
  • Deployment strategies (blue-green, canary, rolling)
  • Release automation
  • Rollback procedures
  • Feature flags
  • Progressive delivery
npx claude-code-templates@latest --agent deployment-engineer

Vercel Deployment Specialist

Vercel platform deployment and optimization
npx claude-code-templates@latest --agent vercel-deployment-specialist

Monitoring & Operations

Monitoring Specialist

Observability, monitoring, and alertingExpertise:
  • Metrics collection (Prometheus, DataDog, CloudWatch)
  • Logging (ELK, Loki, CloudWatch Logs)
  • Distributed tracing (Jaeger, Zipkin, Tempo)
  • Dashboards and visualization
  • Alerting and on-call
  • SLO/SLI definition
npx claude-code-templates@latest --agent monitoring-specialist

Incident Responder

Incident response and troubleshooting
npx claude-code-templates@latest --agent incident-responder

DevOps Incident Responder

DevOps-focused incident management
npx claude-code-templates@latest --agent devops-incident-responder

DevOps Troubleshooter

Production troubleshooting and debugging
npx claude-code-templates@latest --agent devops-troubleshooter

Security Engineering

Security Engineer

Infrastructure security and complianceExpertise:
  • Security architecture
  • Compliance (SOC2, HIPAA, GDPR)
  • Vulnerability management
  • Security automation
  • Penetration testing
  • Zero trust architecture
npx claude-code-templates@latest --agent security-engineer

Network Engineer

Network architecture and security
npx claude-code-templates@latest --agent network-engineer

Specialized Infrastructure Agents

Azure Logic Apps Expert

Azure Logic Apps integration and automation
npx claude-code-templates@latest --agent azure-logic-apps-expert

Azure Verified Modules Bicep

Azure Verified Modules with Bicep
npx claude-code-templates@latest --agent azure-verified-modules-bicep

Azure Verified Modules Terraform

Azure Verified Modules with Terraform
npx claude-code-templates@latest --agent azure-verified-modules-terraform

M365 Admin

Microsoft 365 administration
npx claude-code-templates@latest --agent m365-admin

Kusto Assistant

Azure Data Explorer (Kusto) queries
npx claude-code-templates@latest --agent kusto-assistant

Windows Infrastructure Admin

Windows server infrastructure
npx claude-code-templates@latest --agent windows-infra-admin

DevOps Workflows

Complete CI/CD Pipeline

# 1. Infrastructure setup
@terraform-engineer Create Terraform configuration for:
- Kubernetes cluster (EKS/AKS/GKE)
- Container registry
- Monitoring stack
- Load balancer

# 2. Kubernetes deployment
@kubernetes-specialist Create Kubernetes manifests for:
- Microservices deployment
- Ingress configuration
- ConfigMaps and Secrets
- HorizontalPodAutoscaler

# 3. CI/CD pipeline
@devops-engineer Set up GitHub Actions workflow for:
- Build and test
- Docker image build and push
- Kubernetes deployment
- Automated rollback on failure

# 4. Monitoring
@monitoring-specialist Set up:
- Prometheus metrics
- Grafana dashboards
- Alert rules for SLOs
- On-call rotation

Infrastructure as Code Workflow

# 1. Plan infrastructure
@cloud-architect Design AWS infrastructure for microservices

# 2. Generate Terraform
@terraform-engineer Create Terraform modules for:
- VPC and networking
- EKS cluster
- RDS database
- S3 buckets
- IAM roles

# 3. Review
@terraform-iac-reviewer Review Terraform code for best practices

# 4. Apply
@devops-engineer Apply Terraform and verify infrastructure

Kubernetes Migration

# 1. Architecture
@microservices-architect Design microservices architecture

# 2. Containerization
@devops-engineer Containerize existing applications

# 3. Kubernetes setup
@kubernetes-specialist:
- Create Kubernetes manifests
- Set up namespaces and RBAC
- Configure networking
- Set up persistent storage

# 4. Migration
@deployment-engineer Plan and execute migration with zero downtime

Best Practices

1. Infrastructure as Code

# Version control everything
@terraform-engineer Store all infrastructure as code in Git

# Use modules for reusability
@terraform-engineer Create reusable Terraform modules

# Implement GitOps
@devops-engineer Set up ArgoCD for Kubernetes deployments

2. Observability

# Three pillars of observability
@monitoring-specialist Set up:
- Metrics (Prometheus)
- Logs (Loki/ELK)
- Traces (Tempo/Jaeger)

# Define SLOs
@sre-engineer Define Service Level Objectives:
- Availability: 99.9%
- Latency: p95 < 200ms
- Error rate: < 0.1%

3. Security

# Security scanning
@security-engineer Implement:
- Container image scanning
- Infrastructure vulnerability scanning
- Secrets management (Vault, AWS Secrets Manager)
- Network policies

# Compliance
@security-engineer Ensure compliance with SOC2/HIPAA/GDPR

4. Cost Optimization

# Right-sizing
@cloud-architect Analyze and optimize resource usage

# Auto-scaling
@kubernetes-specialist Implement HPA and cluster autoscaling

# Reserved instances
@cloud-architect Purchase reserved instances for predictable workloads

Real-World Examples

Example 1: Microservices on Kubernetes

# Install specialists
npx claude-code-templates@latest \
  --agent kubernetes-specialist \
  --agent microservices-architect \
  --agent monitoring-specialist

# Deploy
@microservices-architect Design service boundaries and communication
@kubernetes-specialist Deploy services with:
- Service mesh (Istio) for traffic management
- mTLS for secure communication
- Distributed tracing
- Canary deployments
@monitoring-specialist Set up observability stack

Example 2: Multi-Cloud Infrastructure

# Install cloud specialists
npx claude-code-templates@latest \
  --agent cloud-architect \
  --agent terraform-engineer \
  --agent devops-engineer

# Design and implement
@cloud-architect Design multi-cloud architecture with:
- Primary workloads on AWS
- DR site on Azure
- CDN on Cloudflare

@terraform-engineer Implement with Terraform Cloud

Example 3: SRE Implementation

# Install SRE tools
npx claude-code-templates@latest \
  --agent sre-engineer \
  --agent monitoring-specialist \
  --agent incident-responder

# Implement SRE practices
@sre-engineer:
- Define SLOs and error budgets
- Set up monitoring and alerting
- Create runbooks
- Implement chaos engineering

DevOps Template

npx claude-code-templates@latest --template devops-pipeline
Includes:
  • DevOps engineer agent
  • Cloud architect agent
  • Kubernetes specialist agent
  • Terraform engineer agent
  • Monitoring specialist agent
  • Deployment automation
  • Infrastructure monitoring

Next Steps

Build docs developers (and LLMs) love