Overview
The/use-local-whisper skill switches voice transcription from OpenAI’s Whisper API to local whisper.cpp running on your device. All transcription happens locally — no API key, no network calls, no cost.
Requires
/add-voice-transcription skill to be applied first. Currently supports WhatsApp only.Advantages
Zero Cost
No OpenAI API usage fees
Privacy
Audio never leaves your device
Offline
Works without internet connection
Fast
Apple Silicon optimization
Prerequisites
How to Apply
- Verify dependencies are installed
- Check for model file in
data/models/ - Modify
src/transcription.tsto use local whisper - Update error handling for local execution
- Rebuild and restart
What Changes
Files Modified
src/transcription.ts- Switches from OpenAI API to whisper-clisrc/channels/whatsapp.ts- Audio handling updated
Behavioral Changes
- Voice messages transcribed locally
- No network calls for transcription
- Slightly different transcription quality (depends on model)
Model Selection
| Model | Size | Speed | Quality | Use Case |
|---|---|---|---|---|
| tiny | ~75MB | Fastest | Lower | Quick transcription |
| base | ~150MB | Fast | Good | General use |
| small | ~500MB | Medium | Better | Accuracy priority |
| medium | ~1.5GB | Slow | Best | Maximum quality |
Start with
base model for best balance of speed and quality.Configuration
Environment Variables
Path to GGML model file (default:
data/models/ggml-base.bin)Language code for better accuracy (default:
en)Usage
No changes to user experience:Performance
Apple Silicon (M1/M2/M3)
- base model: ~2-3x realtime (10s audio = 3-5s to transcribe)
- small model: ~1-1.5x realtime (10s audio = 7-10s to transcribe)
Intel Mac
- base model: ~1-1.5x realtime
- small model: ~0.5-1x realtime (may be slower than audio)
Troubleshooting
whisper-cli not found
whisper-cli not found
Install whisper-cpp:
Model file not found
Model file not found
Check model path:Download if missing (see Prerequisites above).
Slow transcription
Slow transcription
Try smaller model:Or upgrade to Apple Silicon if on Intel Mac.
Poor transcription quality
Poor transcription quality
Use larger model:
Switching Back to OpenAI
To revert to OpenAI Whisper API:-
Restore from git:
-
Rebuild:
- Restart service
Related Documentation
Voice Transcription
Base voice transcription skill
WhatsApp Channel
WhatsApp setup and features