What is Hybrid Mode
Hybrid mode allows SlasshyWispr to intelligently route voice processing tasks between online (cloud-based) and local (on-device) models. This gives you the flexibility to balance between performance, privacy, and cost based on your needs. With hybrid mode, you can configure:- STT (Speech-to-Text) to use online or local models
- AI (Assistant) to use online or local models
- Online STT + Local AI
- Local STT + Online AI
- All online or all local
How Hybrid Routing Works
SlasshyWispr provides three runtime modes for each component:Online Mode
Routes requests to cloud-based API providers. Requires API credentials and internet connection. Typically offers the best quality and fastest processing for complex tasks.
Local Mode
Routes requests to models running on your device. For STT, this uses downloaded Parakeet or Whisper models. For AI, this uses locally running Ollama models. Works completely offline.
Component-Level Configuration
The app tracks separate runtime modes:sttRuntimeMode: Controls speech-to-text processing (online|local)aiRuntimeMode: Controls AI assistant responses (online|local)runtimeMode: Legacy setting for overall mode preference
When to Use Hybrid Mode
Privacy-First Workflows
Privacy-First Workflows
Use local STT + local AI when handling sensitive information. All processing stays on your device with no data sent to external servers.Example: Medical dictation, legal notes, financial planning
Performance-Optimized Setup
Performance-Optimized Setup
Use online STT + local AI for fast transcription with privacy-conscious responses.Example: General productivity work where transcription speed matters but you want to keep AI reasoning local
Quality-First Configuration
Quality-First Configuration
Use online STT + online AI when you need the highest quality results and have reliable internet.Example: Professional content creation, complex research queries
Offline-Ready Setup
Offline-Ready Setup
Use local STT + local AI when working without internet access or with unreliable connectivity.Example: Travel, remote locations, air-gapped environments
Configuration for Hybrid
Configure STT Runtime
Choose your STT runtime mode:
- Select Online to use cloud-based speech recognition
- Select Offline to use local Parakeet or Whisper models
- Parakeet v3 (478 MB) - Recommended
- Parakeet v2 (473 MB)
- Whisper models (487 MB - 1.6 GB)
- Moonshine Base (58.0 MB)
- SenseVoice (160 MB)
Configure AI Runtime
Choose your AI runtime mode:
- Select Online to use cloud-based language models
- Select Offline to use local Ollama models
Set Up Credentials (if using Online)
If either component uses online mode:
- Enter your API Base URL
- Add your API Key
- Specify model names for STT and/or AI
Best Practices
Start with Online Mode: If you’re new to SlasshyWispr, start with online mode for both components to get the best initial experience, then experiment with local models once you’re comfortable.
Optimization Tips
- Model Selection: Choose smaller local models (like Moonshine Base) for speed, larger models (like Whisper Large) for accuracy
- Warmup Models: Local STT models load faster after first use. Consider warming up your model before important sessions
- API Costs: Monitor your online API usage and switch to local models for routine tasks to reduce costs
- Latency Monitoring: Check Settings > Pipeline to view real-time STT, AI, and TTS latencies to tune your configuration