Security Best Practices

Agent Mesh Enterprise provides multiple security layers to protect your deployment. This guide covers authentication, authorization, network security, credential management, and operational security best practices.

Security Architecture

Agent Mesh Enterprise implements defense-in-depth security:

Security Layers

Network Layer: TLS encryption, firewall rules, DDoS protection
Authentication Layer: OAuth2, token validation, session management
Authorization Layer: RBAC, scope enforcement, policy decisions
Transport Layer: Secure broker messaging, certificate validation
Data Layer: Credential management, encryption at rest, access control

Authentication Security

OAuth2 Configuration

Production Settings

Never use development mode in production:

# ❌ NEVER in production
frontend_use_authorization: false
authorization_service:
  type: "none"

# ✅ Production configuration
frontend_use_authorization: true
authorization_service:
  type: "default_rbac"
  role_to_scope_definitions_path: "config/auth/roles.yaml"
  user_to_role_assignments_path: "config/auth/users.yaml"

Disable Development Mode

Disable OAuth2 development mode:

# oauth2_config.yaml
enabled: true
development_mode: false  # ✅ Enforces HTTPS and strict validation

Development mode (true) allows:

HTTP connections (insecure)
Relaxed token scope validation
Insecure transport

Never enable in production.

Token Security

SAM Access Tokens

Configure secure token settings:

app_config:
  sam_access_token:
    enabled: true
    ttl_seconds: 3600  # 1 hour
    clock_skew_tolerance: 300  # 5 minutes
    
  # Session timeout (should be <= token TTL)
  session:
    timeout: 3600
    secure_cookies: true
    httponly_cookies: true
    samesite: "strict"

Best Practices:

Token TTL: 1-4 hours (balance security vs. UX)
Session timeout: Match or be shorter than token TTL
Clock skew: Account for distributed systems (300s recommended)
Secure cookies: Always true in production
HttpOnly cookies: Prevent XSS attacks
SameSite: strict or lax for CSRF protection

Token Refresh

Implement automatic token refresh:

// Frontend token refresh logic
const refreshToken = async () => {
  const response = await fetch('/api/v1/auth/refresh', {
    method: 'POST',
    credentials: 'include'
  });
  
  if (response.ok) {
    const { access_token } = await response.json();
    localStorage.setItem('access_token', access_token);
  } else {
    // Redirect to login
    window.location.href = '/api/v1/auth/login';
  }
};

// Refresh before expiration
setInterval(refreshToken, 3300000);  // 55 minutes

Multi-Factor Authentication

Enforce MFA at the identity provider level: Azure AD:

Azure Portal → Microsoft Entra ID → Security → Conditional Access
→ Create policy requiring MFA for all users

Okta:

Okta Admin → Security → Multifactor → Add Factor
→ Create policy requiring MFA

Agent Mesh Enterprise inherits MFA enforcement from your IdP.

Authorization Security

RBAC Best Practices

Principle of Least Privilege

Grant minimum permissions required:

# ❌ Too permissive
roles:
  analyst:
    scopes:
      - "*"  # Full access

# ✅ Specific permissions
roles:
  analyst:
    description: "Data analyst with limited access"
    scopes:
      - "tool:data:read"           # Read data tools
      - "tool:artifact:load"        # Load artifacts
      - "tool:artifact:create"      # Create artifacts
      - "agent:analytics:delegate"  # Specific agent only

Wildcard Usage

Minimize wildcard scopes:

# ❌ Overly broad
scopes:
  - "tool:*:*"  # All tools, all actions

# ✅ Specific wildcards
scopes:
  - "tool:data:*"              # All data tool actions
  - "agent:customer_*:delegate" # Customer service agents only

Wildcards acceptable for:

Admin roles (documented and audited)
Logical groupings (e.g., tool:data:* for data analysts)

Role Separation

Separate read/write permissions:

roles:
  data_viewer:
    description: "Read-only data access"
    scopes:
      - "tool:data:read"
      - "tool:artifact:load"
      - "monitor/namespace/*:a2a_messages:subscribe"
  
  data_operator:
    description: "Data operations"
    inherits: ["data_viewer"]
    scopes:
      - "tool:data:write"
      - "tool:data:execute"
      - "tool:artifact:create"
  
  data_admin:
    description: "Full data management"
    inherits: ["data_operator"]
    scopes:
      - "tool:data:delete"
      - "tool:data:admin"
      - "tool:artifact:delete"

Custom Tool Security

Enforce fine-grained access on custom tools:

# Tool definition
components:
  - component_name: production_database_query
    component_module: custom_tools
    component_config:
      tool_name: "prod_db_query"
      required_scopes:
        - "database:production:read"  # Specific scope
      database:
        host: "prod-db.internal"
        read_only: true

# Role assignment
roles:
  senior_analyst:
    scopes:
      - "database:production:read"  # Grants access
  
  junior_analyst:
    scopes:
      - "database:staging:read"     # No production access

Agent Access Control

Restrict agent access per user:

roles:
  customer_support:
    scopes:
      - "agent:customer_support:delegate"  # Support agent only
      - "agent:knowledge_base:delegate"    # KB agent only
      # No access to admin or sensitive agents
  
  system_admin:
    scopes:
      - "agent:*:delegate"  # All agents

Network Security

TLS/SSL Configuration

Gateway HTTPS

Always use HTTPS in production:

# webui.yaml
app_config:
  ssl_certfile: "/app/certs/fullchain.pem"
  ssl_keyfile: "/app/certs/privkey.pem"
  ssl_ca_certs: "/app/certs/ca-bundle.pem"  # Optional: client cert validation

OAuth2 Service HTTPS

# oauth2_server.yaml
shared_config:
  - oauth2_config: &oauth2_config
      ssl_cert: "/app/certs/oauth2-cert.pem"
      ssl_key: "/app/certs/oauth2-key.pem"

Broker TLS

Secure broker connections:

broker:
  url: "tcps://broker.example.com:55443"  # TLS port
  vpn: "enterprise_vpn"
  username: "${BROKER_USERNAME}"
  password: "${BROKER_PASSWORD}"
  
  # Certificate validation
  ssl:
    verify_mode: "CERT_REQUIRED"
    ca_certs: "/app/certs/broker-ca.pem"
    certfile: "/app/certs/client-cert.pem"
    keyfile: "/app/certs/client-key.pem"

Certificate Management

Let’s Encrypt Automation

#!/bin/bash
# renew-certs.sh - Automated certificate renewal

# Renew certificates
certbot renew --quiet

# Copy to application directory
cp /etc/letsencrypt/live/yourdomain.com/fullchain.pem /app/certs/
cp /etc/letsencrypt/live/yourdomain.com/privkey.pem /app/certs/

# Set permissions
chown sam-app:sam-app /app/certs/*.pem
chmod 600 /app/certs/*.pem

# Reload gateway (graceful restart)
docker exec sam-enterprise kill -HUP 1

Schedule with cron:

# Renew certificates daily at 2 AM
0 2 * * * /usr/local/bin/renew-certs.sh

Certificate Validation

Verify certificates before deployment:

# Check certificate expiration
openssl x509 -in /app/certs/cert.pem -noout -dates

# Verify certificate chain
openssl verify -CAfile /app/certs/ca-bundle.pem /app/certs/cert.pem

# Check certificate matches key
openssl x509 -in /app/certs/cert.pem -noout -modulus | md5sum
openssl rsa -in /app/certs/key.pem -noout -modulus | md5sum
# MD5 sums should match

Firewall Configuration

Inbound Rules

# Allow HTTPS traffic
sudo ufw allow 443/tcp comment 'WebUI Gateway'

# Allow OAuth2 service (if externally accessible)
sudo ufw allow 8080/tcp comment 'OAuth2 Service'

# Allow Platform Service API
sudo ufw allow 8001/tcp comment 'Platform Service'

# Deny all other inbound
sudo ufw default deny incoming

Outbound Rules

# Allow outbound HTTPS
sudo ufw allow out 443/tcp comment 'HTTPS'

# Allow broker connection
sudo ufw allow out 55443/tcp comment 'Solace Broker TLS'

# Allow DNS
sudo ufw allow out 53 comment 'DNS'

CORS Configuration

Restrict Cross-Origin Resource Sharing:

# Production: Specific origins only
app_config:
  cors_allowed_origins:
    - "https://yourdomain.com"
    - "https://app.yourdomain.com"

# ❌ NEVER in production
cors_allowed_origins:
  - "*"  # Allows any origin

Credential Management

Environment Variables

Store secrets as environment variables:

# ✅ Good: Environment variables
export AZURE_CLIENT_SECRET="$(cat /run/secrets/azure_secret)"
export BROKER_PASSWORD="$(cat /run/secrets/broker_password)"

# ❌ Bad: Hardcoded in files
client_secret: "abc123..."  # Never do this

Docker Secrets

Use Docker secrets for sensitive data:

# Create secrets
echo "azure-client-secret-value" | docker secret create azure_secret -
echo "broker-password-value" | docker secret create broker_password -

# Use in Docker Compose
docker-compose.yml:

services:
  sam-enterprise:
    image: solace-agent-mesh-enterprise:latest
    secrets:
      - azure_secret
      - broker_password
    environment:
      - AZURE_CLIENT_SECRET_FILE=/run/secrets/azure_secret
      - BROKER_PASSWORD_FILE=/run/secrets/broker_password

secrets:
  azure_secret:
    external: true
  broker_password:
    external: true

Kubernetes Secrets

For Kubernetes deployments:

apiVersion: v1
kind: Secret
metadata:
  name: sam-credentials
type: Opaque
data:
  azure-client-secret: <base64-encoded-value>
  broker-password: <base64-encoded-value>
---
apiVersion: v1
kind: Pod
metadata:
  name: sam-enterprise
spec:
  containers:
  - name: sam
    image: solace-agent-mesh-enterprise:latest
    env:
    - name: AZURE_CLIENT_SECRET
      valueFrom:
        secretKeyRef:
          name: sam-credentials
          key: azure-client-secret
    - name: BROKER_PASSWORD
      valueFrom:
        secretKeyRef:
          name: sam-credentials
          key: broker-password

Secret Rotation

Implement regular credential rotation:

#!/bin/bash
# rotate-secrets.sh

# Generate new OAuth2 client secret in Azure
NEW_SECRET=$(az ad app credential reset \
  --id $AZURE_APP_ID \
  --append \
  --query password -o tsv)

# Update Docker secret (creates new version)
echo "$NEW_SECRET" | docker secret create azure_secret_v2 -

# Update service to use new secret
docker service update \
  --secret-rm azure_secret \
  --secret-add source=azure_secret_v2,target=azure_secret \
  sam-enterprise

# After validation, remove old secret
# (Wait 24 hours for verification)
# docker secret rm azure_secret_v1

Rotation schedule:

OAuth2 secrets: Every 90 days
Database passwords: Every 90 days
API keys: Every 180 days
SSL certificates: Automated (Let’s Encrypt)

Connector Security

Shared Credential Model

Understand connector security implications:

# All agents assigned to this connector share credentials
# Security boundaries exist at external system level

connector:
  name: "production_database"
  type: "sql"
  credentials:
    username: "app_reader"  # Limited permissions
    password: "${DB_PASSWORD}"
  
  # Database-level security
  database:
    grants:
      - "SELECT ON analytics.*"  # Read-only
      # No INSERT, UPDATE, DELETE

Principle of Least Privilege

Configure minimal database permissions:

-- Create read-only user for connector
CREATE USER 'sam_readonly'@'%' IDENTIFIED BY 'strong-password';

-- Grant SELECT only on specific schemas
GRANT SELECT ON analytics.* TO 'sam_readonly'@'%';
GRANT SELECT ON reporting.* TO 'sam_readonly'@'%';

-- Deny all other privileges
REVOKE ALL PRIVILEGES ON *.* FROM 'sam_readonly'@'%';

-- No admin privileges
-- No INSERT, UPDATE, DELETE
-- No CREATE, DROP, ALTER

API Key Scoping

Use scoped API keys for OpenAPI connectors:

# OpenAPI connector with minimal permissions
connector:
  name: "external_api"
  type: "openapi"
  authentication:
    type: "api_key"
    api_key: "${API_KEY}"  # Scoped to read-only operations

Configure API key at provider:

Stripe Dashboard → Developers → API Keys
→ Create restricted key
→ Scope: Read-only
→ Resources: Customers, Invoices (read only)

Operational Security

Audit Logging

Enable comprehensive audit logging:

log:
  stdout_log_level: INFO
  log_file_level: DEBUG
  log_file: /app/logs/audit.log
  
  # Log all authorization decisions
  log_auth_decisions: true
  
  # Log all token operations
  log_token_operations: true
  
  # Log configuration changes
  log_config_changes: true

Log forwarding to SIEM:

# Filebeat configuration for ELK Stack
filebeat.inputs:
- type: log
  paths:
    - /app/logs/audit.log
  fields:
    application: sam-enterprise
    environment: production

output.elasticsearch:
  hosts: ["elk.example.com:9200"]
  username: "filebeat"
  password: "${FILEBEAT_PASSWORD}"

Security Monitoring

Monitor for security events:

# Prometheus metrics
metrics:
  enabled: true
  port: 9090
  
  # Security metrics
  track:
    - authentication_failures
    - authorization_denials
    - token_validation_errors
    - rate_limit_hits
    - invalid_credentials

Alert on suspicious activity:

# Prometheus alert rules
groups:
- name: security
  rules:
  - alert: HighAuthenticationFailures
    expr: rate(auth_failures_total[5m]) > 10
    annotations:
      summary: "High authentication failure rate"
  
  - alert: AuthorizationDenials
    expr: rate(authz_denials_total[5m]) > 5
    annotations:
      summary: "Unusual authorization denial rate"

Rate Limiting

Implement rate limiting to prevent abuse:

# OAuth2 service rate limiting
security:
  rate_limit:
    enabled: true
    requests_per_minute: 60
    burst: 10
    
    # Per-user limits
    per_user_limits:
      enabled: true
      requests_per_minute: 30

DDoS Protection

Implement DDoS mitigation:

# Nginx reverse proxy with rate limiting
http {
  limit_req_zone $binary_remote_addr zone=auth:10m rate=10r/s;
  limit_req_zone $binary_remote_addr zone=api:10m rate=100r/s;
  
  server {
    location /api/v1/auth/ {
      limit_req zone=auth burst=5 nodelay;
      proxy_pass http://localhost:8000;
    }
    
    location /api/ {
      limit_req zone=api burst=20 nodelay;
      proxy_pass http://localhost:8000;
    }
  }
}

Compliance

Data Retention

Implement data retention policies:

data_retention:
  # Chat history
  conversations:
    retention_days: 90
    archive_after_days: 30
  
  # Audit logs
  audit_logs:
    retention_days: 365
    archive_after_days: 90
  
  # Artifacts
  artifacts:
    retention_days: 180
    auto_delete: true

Encryption at Rest

Encrypt sensitive data:

# Database encryption
# PostgreSQL with encryption
postgresql.conf:
  ssl = on
  ssl_cert_file = '/path/to/server.crt'
  ssl_key_file = '/path/to/server.key'

# Disk encryption
# LUKS for Linux volumes
sudo cryptsetup luksFormat /dev/sdb
sudo cryptsetup open /dev/sdb sam-data
sudo mkfs.ext4 /dev/mapper/sam-data

Privacy Controls

Implement privacy protections:

privacy:
  # PII redaction
  redact_pii: true
  pii_patterns:
    - email
    - phone
    - ssn
    - credit_card
  
  # Data anonymization
  anonymize_logs: true
  
  # Right to be forgotten
  enable_user_deletion: true

Security Checklist

Pre-Production

Post-Deployment

Regular security audits scheduled
Credential rotation implemented
Certificate renewal automated
Backup procedures tested
Incident response plan documented
Security patches applied promptly
Access reviews conducted quarterly
Penetration testing performed annually

Incident Response

Security Incident Procedure

Detect: Security monitoring alerts
Contain: Disable compromised credentials
Investigate: Review audit logs
Remediate: Rotate secrets, patch vulnerabilities
Document: Incident report
Learn: Update security procedures

Emergency Lockdown

Procedure for security breach:

#!/bin/bash
# emergency-lockdown.sh

# 1. Disable all access
docker exec sam-enterprise kill -STOP 1

# 2. Backup current state
docker commit sam-enterprise sam-forensics:$(date +%Y%m%d-%H%M%S)

# 3. Rotate all credentials
./rotate-all-secrets.sh

# 4. Enable emergency mode (deny all)
export SAM_AUTHORIZATION_CONFIG='{ "authorization_service": { "type": "deny_all" } }'

# 5. Restart with new config
docker restart sam-enterprise

# 6. Notify security team
echo "SECURITY INCIDENT: SAM lockdown activated" | \
  mail -s "[URGENT] SAM Security Lockdown" [email protected]

Getting Started

Installation & Configuration

Core Concepts

Components

Built-in Tools

Developer Guides

Deployment

Enterprise Features

​Security Best Practices

​Security Architecture

​Security Layers

​Authentication Security

​OAuth2 Configuration

​Production Settings

​Disable Development Mode

​Token Security

​SAM Access Tokens

​Token Refresh

​Multi-Factor Authentication

​Authorization Security

​RBAC Best Practices

​Principle of Least Privilege

​Wildcard Usage

​Role Separation

​Custom Tool Security

​Agent Access Control

​Network Security

​TLS/SSL Configuration

​Gateway HTTPS

​OAuth2 Service HTTPS

​Broker TLS

​Certificate Management

​Let’s Encrypt Automation

​Certificate Validation

​Firewall Configuration

​Inbound Rules

​Outbound Rules

​CORS Configuration

​Credential Management

​Environment Variables

​Docker Secrets

​Kubernetes Secrets

​Secret Rotation

​Connector Security

​Shared Credential Model

​Principle of Least Privilege

​API Key Scoping

​Operational Security

​Audit Logging

​Security Monitoring

​Rate Limiting

​DDoS Protection

​Compliance

​Data Retention

​Encryption at Rest

​Privacy Controls

​Security Checklist

​Pre-Production

​Post-Deployment

​Incident Response

​Security Incident Procedure

​Emergency Lockdown

​Resources

​Security Documentation

​Security Tools

​Next Steps

Authentication

Connectors

Build docs developers (and LLMs) love

Security Best Practices

Security Architecture

Security Layers

Authentication Security

OAuth2 Configuration

Production Settings

Disable Development Mode

Token Security

SAM Access Tokens

Token Refresh

Multi-Factor Authentication

Authorization Security

RBAC Best Practices

Principle of Least Privilege

Wildcard Usage

Role Separation

Custom Tool Security

Agent Access Control

Network Security

TLS/SSL Configuration

Gateway HTTPS

OAuth2 Service HTTPS

Broker TLS

Certificate Management

Let’s Encrypt Automation

Certificate Validation

Firewall Configuration

Inbound Rules

Outbound Rules

CORS Configuration

Credential Management

Environment Variables

Docker Secrets

Kubernetes Secrets

Secret Rotation

Connector Security

Shared Credential Model

Principle of Least Privilege

API Key Scoping

Operational Security

Audit Logging

Security Monitoring

Rate Limiting

DDoS Protection

Compliance

Data Retention

Encryption at Rest

Privacy Controls

Security Checklist

Pre-Production

Post-Deployment

Incident Response

Security Incident Procedure

Emergency Lockdown

Resources

Security Documentation

Security Tools

Next Steps