Setting Up Ollama for Local AI

Ollama provides local AI inference without cloud dependencies. Asta uses Ollama for:

RAG embeddings with nomic-embed-text
Local chat with models like llama3, mistral, or qwen
Privacy-focused workflows that keep data on your machine

Install Ollama

Download and install Ollama from ollama.commacOS/Linux:

curl -fsSL https://ollama.com/install.sh | sh

Windows: Download the installer from ollama.com/downloadVerify installation:

ollama --version

Pull the RAG embedding model

Asta’s Learning skill requires nomic-embed-text for semantic search:

ollama pull nomic-embed-text

This model is lightweight (~274 MB) and optimized for embeddings.

Start Ollama server

Ollama runs as a background service. Start it with:

ollama serve

Or launch the Ollama app. The API runs at http://localhost:11434 by default.

On macOS, Ollama starts automatically when you pull a model or run ollama serve.

Pull a chat model (optional)

For local chat inference, pull a model:

# Lightweight (4GB)
ollama pull llama3.2:3b

# Balanced (7GB)
ollama pull mistral

# Advanced (16GB+)
ollama pull llama3.1:70b

List available models at ollama.com/library

Configure Asta to use Ollama

In Asta’s web panel or desktop app:

Go to Settings → AI Providers
Enable Ollama and set it as your default provider
Select a chat model (e.g., llama3.2:3b)
Save settings

Asta will automatically use Ollama for RAG if nomic-embed-text is available.

Quick Setup Script

Asta includes a setup script that installs Ollama (Linux/macOS) and pulls the RAG model:

cd ~/workspace/source
./scripts/setup_ollama_rag.sh -i

What it does:

Installs Ollama via curl | sh (Linux/macOS only)
Pulls nomic-embed-text
Verifies the installation

Source: ~/workspace/source/scripts/setup_ollama_rag.sh

Configuration

Environment Variables

Set these in .env or Settings:

# Ollama API endpoint
OLLAMA_BASE_URL=http://localhost:11434

# Model for RAG embeddings
ASTAMISTRAL_OLLAMA_EMBEDDING_MODEL=nomic-embed-text

# Default chat model
OLLAMA_MODEL=llama3.2:3b

Testing RAG

Once configured, test the Learning skill:

Learn about Python asyncio for 2 minutes

Asta will:

Research the topic via web search
Generate embeddings with nomic-embed-text
Store knowledge in ChromaDB for retrieval

Troubleshooting

Ollama not found in PATH

After installation, restart your terminal or add Ollama to PATH:

export PATH="$PATH:/usr/local/bin"

On macOS, Ollama installs to /usr/local/bin by default.

Connection refused at localhost:11434

Ensure Ollama is running:

ollama serve

Or launch the Ollama app from your Applications folder.

Model not found error

Verify the model is pulled:

ollama list

If missing, pull it:

ollama pull nomic-embed-text

RAG embeddings not working

Check these steps:

nomic-embed-text is pulled: ollama list
Ollama is running: curl http://localhost:11434/api/tags
Environment variable is set: echo $ASTAMISTRA L_OLLAMA_EMBEDDING_MODEL

Restart Asta backend after configuration changes.

macOS permission dialogs: When Asta first uses Ollama, macOS may prompt for network access. Allow the backend process (Python) to connect to localhost:11434.

Model Recommendations

Model	Size	Use Case
`nomic-embed-text`	274 MB	RAG embeddings (required)
`llama3.2:3b`	4 GB	Fast chat, low memory
`mistral`	7 GB	Balanced quality/speed
`qwen2.5:7b`	7 GB	Excellent for coding
`llama3.1:70b`	40 GB	Advanced reasoning (requires 64GB+ RAM)

Next Steps

Creating Skills - Build custom workspace skills
Telegram Bot Setup - Connect Asta to Telegram
RAG Learning - Deep dive into the Learning skill

Get Started

Core Concepts

Desktop App

Features

Configuration

Guides

Troubleshooting

Setting Up Ollama for Local AI

Setting Up Ollama for Local AI

Quick Setup Script

Configuration

Environment Variables

Testing RAG

Troubleshooting

Model Recommendations

Next Steps

Build docs developers (and LLMs) love

Get Started

Core Concepts

Desktop App

Features

Configuration

Guides

Troubleshooting

​Setting Up Ollama for Local AI

​Quick Setup Script

​Configuration

​Environment Variables

​Testing RAG

​Troubleshooting

​Model Recommendations

​Next Steps

Build docs developers (and LLMs) love

Setting Up Ollama for Local AI

Quick Setup Script

Configuration

Environment Variables

Testing RAG

Troubleshooting

Model Recommendations

Next Steps