Deployment Architecture Overview

ZenML provides a flexible deployment architecture that supports both local development and production-scale deployments. Understanding the deployment options helps you choose the right approach for your MLOps infrastructure.

Architecture Components

A ZenML deployment consists of several key components:

ZenML Server

The ZenML server is the central component that provides:

REST API: Exposes endpoints for pipeline management, artifact tracking, and metadata storage
Dashboard: Web-based UI for monitoring pipelines and managing resources
Authentication: Supports multiple auth schemes (NO_AUTH, HTTP_BASIC, OAUTH2_PASSWORD_BEARER, EXTERNAL)
Database Backend: Stores pipeline metadata, artifacts, and configuration (SQLite or MySQL-compatible databases)
Secrets Management: Secure storage for credentials and sensitive data

Database Layer

ZenML supports two database types: SQLite (Development)

Local file-based database
No external dependencies
Suitable for development and testing
Limited scalability and no concurrent access

MySQL/MariaDB (Production)

External database service
High availability and scalability
Connection pooling and SSL support
Backup and recovery capabilities

Artifact Store

Stores pipeline artifacts and outputs:

Local filesystem (development)
Cloud object storage (S3, GCS, Azure Blob)
Network-attached storage (NFS, EFS)

Orchestrator

Executes pipeline steps:

Local orchestrator (development)
Kubernetes (production)
Cloud-managed services (Vertex AI, SageMaker, AzureML)

Deployment Types

ZenML supports multiple deployment scenarios:

Local Development

Local Server

Quick setup for development and testing on your local machine.

Characteristics:

SQLite database
Local artifact storage
Single-user access
Quick iteration and debugging

Use Cases:

Pipeline development
Local testing
Proof of concepts
Learning ZenML

Docker Deployment

Docker Container

Containerized server deployment for consistent environments.

Characteristics:

Isolated environment
Reproducible setup
Easy to start/stop
Configurable resources

Use Cases:

Development teams
CI/CD integration
Local production testing
Multi-user development

Kubernetes Deployment

Kubernetes with Helm

Production-grade deployment with high availability and scalability.

Characteristics:

Horizontal scaling
High availability
Load balancing
Advanced networking
Resource management

Use Cases:

Production deployments
Enterprise environments
Multi-tenant setups
Cloud-native infrastructure

Cloud-Managed Services

Cloud Platforms

Deploy on managed cloud platforms for fully managed infrastructure.

Characteristics:

Fully managed infrastructure
Automatic scaling
Built-in monitoring
Managed databases and storage

Use Cases:

AWS (ECS, EKS, App Runner)
GCP (Cloud Run, GKE)
Azure (Container Apps, AKS)

Deployment Decision Matrix

Choose your deployment type based on these criteria:

Criteria	Local	Docker	Kubernetes	Cloud-Managed
Ease of Setup	⭐⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐	⭐⭐⭐
Scalability	⭐	⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐
High Availability	❌	❌	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐
Multi-User	⭐	⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐
Production Ready	❌	⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐
Cost	Free	Low	Medium	Medium-High
Maintenance	None	Low	Medium-High	Low

Authentication Schemes

ZenML server supports multiple authentication methods:

NO_AUTH

No authentication required. Suitable only for local development.

zenml connect --url http://localhost:8080 --no-auth

HTTP_BASIC

Simple username/password authentication.

zenml connect --url http://localhost:8080 \
  --username admin \
  --password securepass

OAUTH2_PASSWORD_BEARER

OAuth2-based authentication with JWT tokens (recommended for production).

zenml connect --url https://zenml.example.com

EXTERNAL

Integrate with external authentication providers (SSO, LDAP, etc.).

Environment Variables

Key environment variables for ZenML server configuration:

Server Configuration

# Enable server mode
ZENML_SERVER=true

# Set deployment type for telemetry
ZENML_SERVER_DEPLOYMENT_TYPE=kubernetes

# Authentication scheme
ZENML_SERVER_AUTH_SCHEME=OAUTH2_PASSWORD_BEARER

# JWT secret key (generate with: openssl rand -hex 32)
ZENML_AUTH_JWT_SECRET_KEY=your-secret-key

# Server URL
ZENML_SERVER_URL=https://zenml.example.com

Database Configuration

# MySQL database URL
ZENML_STORE_URL=mysql://user:password@host:3306/zenml

# Connection pool settings
ZENML_STORE_POOL_SIZE=20
ZENML_STORE_MAX_OVERFLOW=20

# SSL configuration
ZENML_STORE_SSL_CA=/path/to/ca.crt
ZENML_STORE_SSL_VERIFY_SERVER_CERT=true

Performance Settings

# Thread pool size for request handling
ZENML_SERVER_THREAD_POOL_SIZE=40

# Authentication thread pool size
ZENML_SERVER_AUTH_THREAD_POOL_SIZE=5

# Request timeout (seconds)
ZENML_SERVER_REQUEST_TIMEOUT=20

# Request cache timeout (seconds)
ZENML_SERVER_REQUEST_CACHE_TIMEOUT=300

Secrets Store Configuration

# Use SQL secrets store
ZENML_SECRETS_STORE_TYPE=sql
ZENML_SECRETS_STORE_ENCRYPTION_KEY=your-encryption-key

# Or use cloud secrets managers
ZENML_SECRETS_STORE_TYPE=aws
ZENML_SECRETS_STORE_AWS_REGION=us-east-1

Security Considerations

Network Security

Use HTTPS in production
Configure proper firewall rules
Implement network policies in Kubernetes
Use VPC/subnet isolation

Authentication

Always use authentication in production
Rotate JWT secret keys regularly
Use strong passwords
Enable external auth for SSO

Secrets Management

Encrypt secrets at rest
Use cloud secrets managers
Rotate credentials regularly
Audit secret access

Database Security

Use SSL for database connections
Verify server certificates
Restrict database access
Regular backups and encryption

High Availability Setup

For production deployments requiring high availability:

Multiple Replicas

# values.yaml for Helm chart
zenml:
  replicaCount: 3
  
autoscaling:
  enabled: true
  minReplicas: 2
  maxReplicas: 10
  targetCPUUtilizationPercentage: 80

External Database

Use managed database services:

AWS RDS (MySQL)
Google Cloud SQL (MySQL)
Azure Database for MySQL

Load Balancing

Distribute traffic across replicas:

Kubernetes Service (ClusterIP)
Ingress Controller (nginx, traefik)
Cloud Load Balancers

Health Checks

Configure readiness and liveness probes:

livenessProbe:
  httpGet:
    path: /health
    port: http
  initialDelaySeconds: 15
  periodSeconds: 15

readinessProbe:
  httpGet:
    path: /ready
    port: http
  initialDelaySeconds: 8
  periodSeconds: 15

Monitoring and Observability

Metrics

ZenML server exposes metrics for monitoring:

Request count and latency
Database connection pool usage
Authentication failures
Pipeline run statistics

Logging

Configure logging verbosity:

# Set log level
ZENML_LOGGING_VERBOSITY=INFO  # DEBUG, INFO, WARNING, ERROR

# Enable debug mode
ZENML_DEBUG=true

Health Endpoints

/health - Liveness check
/ready - Readiness check
/version - Server version information

Next Steps

Server Deployment

Learn about different server deployment options

Kubernetes Deployment

Deploy ZenML on Kubernetes with Helm

Docker Deployment

Run ZenML server in Docker containers

Configuration Guide

Customize your deployment configuration

Getting Started

Core Concepts

Guides

Stack Components

Integrations

Advanced

Deployment

​Deployment Architecture Overview

​Architecture Components

​ZenML Server

​Database Layer

​Artifact Store

​Orchestrator

​Deployment Types

​Local Development

Local Server

​Docker Deployment

Docker Container

​Kubernetes Deployment

Kubernetes with Helm

​Cloud-Managed Services

Cloud Platforms

​Deployment Decision Matrix

​Authentication Schemes

​NO_AUTH

​HTTP_BASIC

​OAUTH2_PASSWORD_BEARER

​EXTERNAL

​Environment Variables

​Server Configuration

​Database Configuration

​Performance Settings

​Secrets Store Configuration

​Security Considerations

Network Security

Authentication

Secrets Management

Database Security

​High Availability Setup

​Multiple Replicas

​External Database

​Load Balancing

​Health Checks

​Monitoring and Observability

​Metrics

​Logging

​Health Endpoints

​Next Steps

Server Deployment

Kubernetes Deployment

Docker Deployment

Configuration Guide

​References

Build docs developers (and LLMs) love

Deployment Architecture Overview

Architecture Components

ZenML Server

Database Layer

Artifact Store

Orchestrator

Deployment Types

Local Development

Docker Deployment

Kubernetes Deployment

Cloud-Managed Services

Deployment Decision Matrix

Authentication Schemes

NO_AUTH

HTTP_BASIC

OAUTH2_PASSWORD_BEARER

EXTERNAL

Environment Variables

Server Configuration

Database Configuration

Performance Settings

Secrets Store Configuration

Security Considerations

High Availability Setup

Multiple Replicas

External Database

Load Balancing

Health Checks

Monitoring and Observability

Metrics

Logging

Health Endpoints

Next Steps

References