This guide assumes you’ve already installed RCLI. If not, run:
Your First Voice Command
Let’s get RCLI running and execute your first voice command in three steps.Launch Interactive Mode
Start the RCLI TUI (terminal user interface):You’ll see a dashboard with:
- Conversation history panel
- Live hardware metrics (CPU, GPU, RAM usage)
- Active models (LLM, STT, TTS)
- Keyboard shortcuts guide
Press and Hold SPACE to Record
Hold down the SPACE bar and speak a command:
“What’s the weather like today?”Release SPACE when done. You’ll see:
- STT: Your speech transcribed to text (~40ms latency)
- LLM: Response generated token-by-token (~160 tok/s)
- TTS: Audio synthesis and playback (~150ms)
On first run, macOS will prompt for microphone permissions. Click OK to grant access. Without microphone access, voice commands won’t work.
Try macOS Actions
RCLI ships with 43 macOS actions for controlling your system by voice. Try these examples:View All Actions
List all 43 available actions:Text Commands (No Voice)
RCLI also accepts text input. Usercli ask for one-shot commands without voice:
Type in the TUI
In interactive mode (rcli), you can also type text directly. Just start typing (no need to press a key):
- Launch
rcli - Type:
what time is it? - Press Enter
Continuous Voice Mode
For hands-free control, use listen mode (always-on voice detection):“Hey RCLI, what’s the time?”
“Open Spotify and play some music”
“Create a reminder to buy milk”Press ESC to stop listening mode.
Interactive TUI Controls
The RCLI TUI (rcli command) supports several keyboard shortcuts:
| Key | Action |
|---|---|
| SPACE | Start/stop push-to-talk recording |
| M | Models panel — browse, download, switch LLM/STT/TTS |
| A | Actions panel — view, enable/disable, test actions |
| B | Benchmarks panel — run STT, LLM, TTS, E2E benchmarks |
| R | RAG panel — ingest documents, clear index |
| D | Cleanup panel — delete unused models |
| T | Toggle tool call trace (see LLM tool calls inline) |
| ESC | Stop processing / close panel / quit |
Tool Call Trace
Press T to enable tool call tracing. When active, every LLM tool call is displayed inline:Query Local Documents (RAG)
RCLI includes a hybrid RAG system (vector + BM25 search) for querying local documents.Ingest Documents
Index a directory of files (PDF, DOCX, TXT):RCLI processes each file, chunks the content, and builds a vector index (~4ms retrieval latency).
Query by Voice or Text
Ask questions about your documents:Or in the TUI, press SPACE and say:
“What were the key decisions in the project plan?”
Check RAG Index Status
Clear the Index
Model Management
RCLI supports 20+ models across LLM, STT, and TTS. Switch models without restarting:Model Storage
All models are stored in~/Library/RCLI/models/. Active model selection persists in ~/Library/RCLI/config.
Free up space by removing unused models:
Benchmarking
RCLI includes a built-in benchmark suite for measuring STT, LLM, TTS, and end-to-end latency.Run Specific Benchmark Suites
stt, llm, tts, e2e, tools, rag, memory, all.
Common Workflows
Workflow 1: Voice-Controlled Music
Workflow 2: Document Intelligence
Workflow 3: System Automation
Workflow 4: Model Experimentation
Next Steps
You’ve learned the basics of RCLI. Explore these topics next:Commands Overview
Deep dive into all RCLI commands and options
macOS Actions
Master all 43 system actions
Model Guide
Learn about 20+ supported AI models
Architecture
Understand the voice pipeline internals
Troubleshooting
No audio output / TTS not working
No audio output / TTS not working
- Check system volume (not muted)
- Verify TTS model is installed:
rcli info - Test TTS directly:
rcli ask --no-speak "hello"(should show text output) - Re-download TTS model:
rcli voices
Voice input not detected
Voice input not detected
- Grant microphone permissions (System Settings → Privacy & Security → Microphone)
- Test microphone:
rcli mic-test(should show audio levels) - Speak louder or closer to the microphone
- Check VAD model:
rcli info(Silero VAD should be listed)
Actions fail to execute
Actions fail to execute
- Check action status:
rcli actions - Test action directly:
rcli action open_app '{"app_name": "Safari"}' - Enable the action if disabled: In the TUI, press A → navigate to action → toggle on
- Check macOS permissions (some actions require Accessibility access)
Slow response times
Slow response times
- Check hardware:
rcli info(ensure Metal GPU is detected) - Switch to a smaller/faster LLM:
rcli upgrade-llm→ Qwen3 0.6B (~250 tok/s) - Close other GPU-intensive apps
- Benchmark your system:
rcli bench
Need Help?
View the full troubleshooting guide for advanced issues