Architecture

Asta is built as a personal control plane with a FastAPI backend and cross-platform Tauri desktop app. This page explains the core architecture, message flow, and component breakdown.

High-Level Architecture

Architecture Overview

Asta follows a client-server architecture with multiple channels (desktop app, Telegram) connecting to a unified backend that orchestrates AI providers, tools, and services.

Component Breakdown

FastAPI Backend

Python 3.12/3.13 backend handling:

REST API + WebSocket endpoints
JWT-based authentication
Message routing and context building
Tool execution and skill orchestration

Location: backend/app/

Desktop App

Tauri v2 (Rust + React/TypeScript) providing:

Cross-platform UI (macOS/Windows)
Dashboard, Chat, Files, Settings
Real-time streaming responses
Local system integration

Location: MACWinApp/asta-app/

AI Provider Layer

Unified interface to multiple AI providers:

Groq, Google Gemini, Claude, OpenAI
OpenRouter, Ollama (local)
Automatic fallback chain
Vision and reasoning support

Location: backend/app/providers/

Skills System

Two-tier skill system:

Built-in Python skills (time, weather, Spotify)
Workspace SKILL.md files (on-demand loading)
Host OS gating for platform-specific skills

Location: backend/app/skills/, workspace/skills/

Telegram Channel

Bot integration for mobile access:

Long polling for updates
Same message handler as desktop
Inline buttons for approvals
Media rendering support

Location: backend/app/channels/telegram_bot.py

Data Layer

Persistent storage:

SQLite for users, conversations, settings
Chroma vector store for RAG
Workspace files (markdown notes)
Per-user memory files

Location: backend/app/db.py, backend/app/rag/

Message Flow

Request Path

When a user sends a message through any channel (desktop app or Telegram), here’s the complete flow:

Authentication

The request passes through AuthMiddleware (backend/app/auth_middleware.py):

JWT validation in multi-user mode
Legacy Bearer token in single-user mode
Sets request.state.user_id and user_role

Context Building

build_context() in backend/app/context.py assembles:

Recent conversation history
Connected channels and ground-truth state
Workspace context (USER.md, SOUL.md, TOOLS.md)
Available skills list (workspace SKILL.md files)
Tool instructions (exec, files, reminders, cron)
Service context (time, weather, Spotify, RAG snippets)

Provider Selection

handler.py determines which AI provider to use:

User’s default provider setting
Provider runtime state (enabled/auto-disabled)
Available API keys
Fallback chain: Claude → Google → OpenRouter → Ollama

Tool Execution Loop

If the AI response includes tool calls:

Handler executes each tool (exec, files, reminders, read)
Appends tool results to message history
Re-calls the same provider for final response
Continues until no more tool calls

Response Delivery

Final response is:

Streamed to desktop app via SSE (Server-Sent Events)
Sent to Telegram as formatted message
Persisted to conversation history in SQLite
Includes reasoning blocks if thinking is enabled

Vision Flow

For messages with images or PDFs:

Native Vision Providers (Claude, Google, OpenAI): Receive images/PDFs directly as part of the message
Non-Vision Providers (Ollama, Groq): Use OpenRouter vision preprocessor fallback chain:
- google/gemma-3-27b-it:free → nvidia/nemotron-nano-12b-v2-vl:free → google/gemma-3-12b-it:free → openrouter/auto
PDF Handling: Claude receives raw PDFs as native document content blocks for full-fidelity reading

Key Backend Files

backend/app/main.py

FastAPI application entry point:

Lifespan context manager (startup/shutdown)
CORS and authentication middleware
Router registration
Database initialization
Telegram bot startup

backend/app/handler.py

Core message handler:

Context building orchestration
Provider selection and fallback
Tool call execution loop
Stream state machine
Thinking/reasoning extraction

backend/app/context.py

Context assembly:

Workspace context injection
Recent conversation history
Ground-truth state (location, reminders count)
Available skills prompt
Service-specific context sections

backend/app/providers/fallback.py

Provider fallback logic:

Fixed chain order resolution
Runtime state checking
API key validation
Auto-disable on billing/auth failures
Stream event lifecycle management

backend/app/auth_middleware.py

Authentication middleware:

Multi-user JWT validation
Single-user Bearer token fallback
Public path exemptions
User context injection

Desktop App Structure

Tauri Application

The desktop app is built with Tauri v2, combining Rust for native system access with React/TypeScript for the UI.

Frontend (React/TypeScript)

MACWinApp/asta-app/src/
├── App.tsx                  # Main app component
├── components/
│   ├── Chat/                # Chat view and message components
│   ├── Settings/            # Settings sheet with tabs
│   ├── Agents/              # Agent management
│   ├── Login/               # Authentication pages
│   └── Dashboard/           # Dashboard widgets
└── utils/                   # API client and helpers

Key Dependencies (from package.json):

@tauri-apps/api - Tauri JavaScript bindings
react-markdown - Markdown rendering for messages
react-syntax-highlighter - Code block syntax highlighting

Backend (Rust)

MACWinApp/asta-app/src-tauri/
├── src/
│   ├── main.rs              # Tauri commands and window setup
│   └── lib.rs               # Additional native functionality
├── Cargo.toml               # Rust dependencies
└── tauri.conf.json          # Tauri configuration

Features:

Global shortcut (Alt+Space) to show/hide window
Auto-start on system boot
Window management and system tray
HTTP client for backend communication

Data Model

Users & Auth
Conversations
Settings
Tasks & Reminders

users table:

id (UUID)
username (unique)
password_hash (bcrypt)
role (admin | user)
created_at

JWT tokens with 30-day expiry containing:

sub: user_id
username
role

conversations table:

id
user_id
channel (web | telegram | subagent)
created_at

messages table:

id
conversation_id
role (user | assistant)
content
provider_used
created_at

user_settings table:

user_id
mood
default_ai_provider
thinking_level (off | minimal | low | medium | high | xhigh)
reasoning_mode (off | on | stream)

provider_models, provider_runtime_state, skill_toggles, api_keys

cron_jobs table:

Recurring cron expressions
One-shot reminders (@at <ISO-UTC>)
Channel targets (web | telegram)

cron_job_runs table:

Execution history with timestamps
Success/failure status

Workspace Structure

The workspace directory (workspace/) contains user data and configuration:

workspace/
├── AGENTS.md                # Workspace instructions
├── SOUL.md                  # Asta's personality/tone
├── USER.md                  # Global user context (single-user)
├── TOOLS.md                 # Custom tool notes
├── users/
│   └── {user_id}/
│       └── USER.md          # Per-user memory (multi-user)
├── skills/
│   ├── notes/
│   │   └── SKILL.md         # Notes skill instructions
│   ├── apple-notes/
│   └── things-mac/
└── notes/                   # User's markdown notes

Workspace skills are loaded on-demand (OpenClaw-style): the AI sees a list of available skills in context, selects one relevant skill, and calls read(path) to load that skill’s SKILL.md instructions only when needed.

Security Considerations

Authentication

JWT-based auth in multi-user mode
Bcrypt password hashing
Role-based access control (admin/user)
Token expiry and refresh

Exec Tool

Allowlist-based command execution
Per-skill required binaries gating
Approval system for sensitive commands
Configurable security modes (deny/allow/full)

File Access

Restricted to ASTA_ALLOWED_PATHS
User must explicitly allow new paths
No access outside user’s home directory
Virtual root for safe knowledge access

API Keys

Stored encrypted in database
Never exposed in logs or responses
Configurable via Settings or .env
Per-provider runtime state tracking

Never commit API keys to version control. Use environment variables or the Settings UI to configure provider keys.

Next Steps

AI Providers

Learn about supported AI providers and the fallback chain

Skills System

Understand how built-in and workspace skills work

Multi-User Auth

Deep dive into authentication and access control

API Reference

Explore the REST API endpoints

Get Started

Core Concepts

Desktop App

Features

Configuration

Guides

Troubleshooting

Architecture

High-Level Architecture

Architecture Overview

Component Breakdown

FastAPI Backend

Desktop App

AI Provider Layer

Skills System

Telegram Channel

Data Layer

Message Flow

Request Path

Vision Flow

Key Backend Files

Desktop App Structure

Tauri Application

Frontend (React/TypeScript)

Backend (Rust)

Data Model

Workspace Structure

Security Considerations

Authentication

Exec Tool

File Access

API Keys

Next Steps

AI Providers

Skills System

Multi-User Auth

API Reference

Build docs developers (and LLMs) love

Get Started

Core Concepts

Desktop App

Features

Configuration

Guides

Troubleshooting

​High-Level Architecture

Architecture Overview

​Component Breakdown

FastAPI Backend

Desktop App

AI Provider Layer

Skills System

Telegram Channel

Data Layer

​Message Flow

​Request Path

​Vision Flow

​Key Backend Files

​Desktop App Structure

Tauri Application

​Frontend (React/TypeScript)

​Backend (Rust)

​Data Model

​Workspace Structure

​Security Considerations

Authentication

Exec Tool

File Access

API Keys

​Next Steps

AI Providers

Skills System

Multi-User Auth

API Reference

Build docs developers (and LLMs) love

High-Level Architecture

Component Breakdown

Message Flow

Request Path

Vision Flow

Key Backend Files

Desktop App Structure

Frontend (React/TypeScript)

Backend (Rust)

Data Model

Workspace Structure

Security Considerations

Next Steps