Deployment Guide

Overview

vLLora is designed to be self-hosted and can be deployed in various environments. This guide covers deployment options, configuration, and best practices for running vLLora in production.

Installation Methods

Homebrew (macOS/Linux)

The easiest way to install vLLora:

brew tap vllora/vllora
brew install vllora

From Source

For development or custom builds:

git clone https://github.com/vllora/vllora.git
cd vllora
cargo build --release

The binary will be available at target/release/vllora.

Starting vLLora

Basic Usage

Start vLLora with default settings:

vllora

This will start:

HTTP API Server: http://0.0.0.0:9090
UI Server: http://0.0.0.0:9091
OTEL Collector: http://[::]:4317
Distri Server: http://0.0.0.0:8081

Using Custom Configuration

Start with a custom config file:

vllora --config /path/to/config.yaml

The default config file location is config.yaml in the current directory.

Command Line Options

vLLora supports the following CLI arguments that override configuration file settings:

Host and Port Configuration

vllora --host 127.0.0.1 --port 8080

Options:

--host <ADDRESS> - Host address to bind to (e.g., 127.0.0.1 for local or 0.0.0.0 for all interfaces)
--port <PORT> - API server port (default: 9090)
--ui-port <UI_PORT> - UI server port (default: 9091)
--distri-port <DISTRI_PORT> - Distri server port (default: 8081)
--otel-port <OTEL_PORT> - OTLP metrics port (default: 4317)

CORS Configuration

vllora --cors-origins "http://localhost:3000,https://example.com"

Specify comma-separated list of allowed CORS origins.

UI Configuration

vllora --open-ui-on-startup false

Disable automatic browser opening on startup.

Production Deployment

Environment Variables

vLLora respects the following environment variables: Logging:

export RUST_LOG=info  # Log level: trace, debug, info, warn, error
export ANSI_OUTPUT=true  # Enable colored output

Database:

export HOME=/path/to/home  # vLLora stores data in ~/.vllora/

LangDB API:

export LANGDB_API_URL=https://langdb.ai/v1  # Custom LangDB API endpoint

File System Layout

vLLora stores data in the following locations:

~/.vllora/
└── vllora.db  # SQLite database for metadata, traces, and metrics

Running as a Service

systemd (Linux)

Create /etc/systemd/system/vllora.service:

[Unit]
Description=vLLora AI Gateway
After=network.target

[Service]
Type=simple
User=vllora
WorkingDirectory=/opt/vllora
ExecStart=/usr/local/bin/vllora --config /etc/vllora/config.yaml
Restart=always
RestartSec=10

# Environment
Environment="RUST_LOG=info"
Environment="HOME=/var/lib/vllora"

# Security
NoNewPrivileges=true
PrivateTmp=true
ProtectSystem=strict
ProtectHome=true
ReadWritePaths=/var/lib/vllora

[Install]
WantedBy=multi-user.target

Enable and start:

sudo systemctl daemon-reload
sudo systemctl enable vllora
sudo systemctl start vllora
sudo systemctl status vllora

Docker (Coming Soon)

Docker support is planned for future releases. For now, you can build from source and containerize manually.

Network Configuration

Reverse Proxy with nginx

upstream vllora_api {
    server 127.0.0.1:9090;
}

upstream vllora_ui {
    server 127.0.0.1:9091;
}

server {
    listen 80;
    server_name api.yourdomain.com;

    location / {
        proxy_pass http://vllora_api;
        proxy_set_header Host $host;
        proxy_set_header X-Real-IP $remote_addr;
        proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
        proxy_set_header X-Forwarded-Proto $scheme;
    }
}

server {
    listen 80;
    server_name ui.yourdomain.com;

    location / {
        proxy_pass http://vllora_ui;
        proxy_set_header Host $host;
        proxy_set_header X-Real-IP $remote_addr;
        proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
        proxy_set_header X-Forwarded-Proto $scheme;
        
        # WebSocket support for real-time features
        proxy_http_version 1.1;
        proxy_set_header Upgrade $http_upgrade;
        proxy_set_header Connection "upgrade";
    }
}

Firewall Configuration

Ensure the following ports are accessible:

9090 - HTTP API (required)
9091 - UI Server (optional, can be restricted)
4317 - OTLP Collector (internal, can be restricted)
8081 - Distri Server (optional, for AI agent features)

High Availability

Database Considerations

vLLora uses SQLite for local storage. For production deployments:

Regular Backups: Set up automated backups of ~/.vllora/vllora.db
Volume Mounting: If containerized, mount persistent volumes for /var/lib/vllora
Connection Pool: vLLora uses a connection pool size of 10 by default

Scaling Considerations

vLLora is designed for single-instance deployment. For high-traffic scenarios:

Use a load balancer with sticky sessions
Consider horizontal scaling (multiple instances with separate databases)
Monitor database size and implement cleanup policies for old traces

Health Checks

Monitor vLLora health using these endpoints:

# Check API availability
curl http://localhost:9090/v1/models

# Check UI availability  
curl http://localhost:9091

Troubleshooting

Port Already in Use

If ports are already in use:

vllora --port 8080 --ui-port 8081

Database Locked Errors

If you see database locked errors:

Ensure only one vLLora instance is running
Check file permissions on ~/.vllora/
Verify no other processes are accessing the database

Permission Denied

Ensure the user running vLLora has:

Write permissions to $HOME/.vllora/
Permission to bind to the configured ports (use ports >1024 for non-root)

Next Steps

Configuration

Learn about advanced configuration options

API Keys

Configure provider API keys

Monitoring

Set up monitoring and observability

API Reference

Explore the REST API

Get Started

Core Concepts

Features

Guides

Deployment Guide

Overview

Installation Methods

Homebrew (macOS/Linux)

From Source

Starting vLLora

Basic Usage

Using Custom Configuration

Command Line Options

Host and Port Configuration

CORS Configuration

UI Configuration

Production Deployment

Environment Variables

File System Layout

Running as a Service

systemd (Linux)

Docker (Coming Soon)

Network Configuration

Reverse Proxy with nginx

Firewall Configuration

High Availability

Database Considerations

Scaling Considerations

Health Checks

Troubleshooting

Port Already in Use

Database Locked Errors

Permission Denied

Next Steps

Configuration

API Keys

Monitoring

API Reference

Build docs developers (and LLMs) love

Get Started

Core Concepts

Features

Guides

​Overview

​Installation Methods

​Homebrew (macOS/Linux)

​From Source

​Starting vLLora

​Basic Usage

​Using Custom Configuration

​Command Line Options

​Host and Port Configuration

​CORS Configuration

​UI Configuration

​Production Deployment

​Environment Variables

​File System Layout

​Running as a Service

​systemd (Linux)

​Docker (Coming Soon)

​Network Configuration

​Reverse Proxy with nginx

​Firewall Configuration

​High Availability

​Database Considerations

​Scaling Considerations

​Health Checks

​Troubleshooting

​Port Already in Use

​Database Locked Errors

​Permission Denied

​Next Steps

Configuration

API Keys

Monitoring

API Reference

Build docs developers (and LLMs) love

Overview

Installation Methods

Homebrew (macOS/Linux)

From Source

Starting vLLora

Basic Usage

Using Custom Configuration

Command Line Options

Host and Port Configuration

CORS Configuration

UI Configuration

Production Deployment

Environment Variables

File System Layout

Running as a Service

systemd (Linux)

Docker (Coming Soon)

Network Configuration

Reverse Proxy with nginx

Firewall Configuration

High Availability

Database Considerations

Scaling Considerations

Health Checks

Troubleshooting

Port Already in Use

Database Locked Errors

Permission Denied

Next Steps