Overview
While the Voice Agent provides conversational AI interaction, Voice Commands offer instant execution of predefined actions through simple voice triggers.Global Voice Shortcuts
Primary Voice Controls
Voice Agent
Ctrl+Alt+JLaunch the full Voice Agent for conversational AI interaction
Voice Transcription
Ctrl+Alt+TToggle voice-to-text transcription mode
Cycle Transcribe Modes
Ctrl+Shift+TSwitch between paste, typewriter, and buffer modes
Stop Auto-typing
Ctrl+Shift+XImmediately halt any ongoing AI typing
Built-in Voice Actions
These actions are available when the Voice Agent is active:System Commands
Change Theme
Change Theme
Voice trigger: “Change the theme to [light/dark]”Switches the Tabby interface between light and dark mode.
End Conversation
End Conversation
Voice trigger: “End conversation” / “Hang up” / “Goodbye”Gracefully ends the voice session and closes the agent.
Desktop Automation Commands
When Windows MCP is enabled, these voice commands are available:- Application Control
- File & Folder
- Web & URLs
- Clipboard
“Open Chrome” / “Launch VS Code” / “Start Notepad”Uses Powershell-Tool to launch applications quickly.
“Switch to Chrome” / “Focus on VS Code”Uses App-Tool to activate running applications.
“Close Chrome” / “Quit Notepad”Uses process management to terminate applications.
Workflow Automation
Create custom voice-triggered workflows by training the Voice Agent:Describe Your Workflow
Tell the agent about your routine:“Every morning I open Slack, Gmail in Chrome, and VS Code with my project folder.”
Example Workflows
Memory-Based Commands
The Voice Agent remembers context across sessions:Personal Context
Personal Context
Example: After telling the agent your name is Sarah:User: “What’s my name?”Agent: “Your name is Sarah.”The agent searches memories first before responding to any query.
Preferences
Preferences
Example: After expressing preference:User: “I prefer dark mode”[Agent stores preference]Later: “Set my theme”[Agent applies dark mode based on stored preference]
Technical Stack
Technical Stack
Example: Sharing your tech stack:User: “I work with React, TypeScript, and Tailwind CSS”[Agent stores technical preferences]Later: “Create a new component”[Agent generates React + TypeScript + Tailwind component]
Creating Custom Commands
You can extend voice commands by adding custom tools to the MCP server or creating new functions in the Voice Agent API.Custom Tool Template
DEFAULT_VOICE_TOOLS array in /lib/ai/voice/index.ts.
Tool Execution Handler
Implement the execution logic in the Voice Agent hook:Command Best Practices
Keyboard Shortcut Reference
Voice Agent
Ctrl+Alt+J
Transcription
Ctrl+Alt+T
Cycle Modes
Ctrl+Shift+T
Stop Typing
Ctrl+Shift+X
Actions Menu
Ctrl+\
Brain Panel
Ctrl+Shift+B
Troubleshooting
Voice not recognized
Voice not recognized
- Check microphone permissions in browser settings
- Verify microphone is selected as default input device
- Test microphone in browser console:
navigator.mediaDevices.getUserMedia({ audio: true }) - Reduce background noise
Commands not executing
Commands not executing
- Ensure Windows MCP server is running (for desktop automation)
- Check API keys are properly configured
- Verify memory backend is accessible at localhost:8000
- Review browser console for error messages
Agent doesn't remember
Agent doesn't remember
- Confirm memory backend is running
- Check Supabase connection for vector storage
- Verify Neo4j connection (if using knowledge graph)
- Try explicitly saying “Remember that” after sharing information
Requirements
Related Features
Voice Agent
Full conversational AI voice assistant
Voice Transcription
Speech-to-text for typing in any app
Action Menu
Quick AI actions triggered by keyboard
Brain Panel
Memory dashboard and management