TTS Engine Selection
Text-to-speech engine
piper- Fast, lightweight, zero-dependency TTS (default)coqui- High-quality neural TTS with voice cloning (requires Python setup)
Coqui is disabled in zero-Python mode. If
ZERO_PYTHON_MODE = true, only Piper is available.Piper Configuration
Piper is the default TTS engine: lightweight, fast, and requires no external dependencies.Path to Piper executableSlasshyWispr automatically installs and configures Piper on first run. This setting is managed internally.
Piper speech rate multiplier
- Range:
0.5to2.0 1.0= normal speed1.08= slightly faster (default)1.5= 50% faster0.8= 20% slower
Piper voice quality preset
fast- Fastest inference, good quality (default)balanced- Balanced speed and qualityhigh- Best quality, slower inference
Piper emotional tone
neutral- Standard neutral voice (default)calm- Calming, relaxed tonehappy- Upbeat, cheerful toneexcited- Energetic, enthusiastic toneserious- Professional, formal tonesad- Somber, low-energy tone
Emotion support depends on the installed voice model. Not all emotions may be available.
Coqui Configuration
Coqui TTS provides studio-quality neural voices with voice cloning capabilities.Coqui requires Python and additional setup. It is disabled when
ZERO_PYTHON_MODE = true.Path to Python executable for CoquiMust point to a Python installation with TTS library installed:
Coqui TTS model identifierDefault model is XTTS v2, a multilingual neural TTS model.Other options:
tts_models/en/ljspeech/tacotron2-DDCtts_models/en/vctk/vits- See Coqui model list:
tts --list_models
Language code for Coqui TTSSupported languages depend on the model. XTTS v2 supports:
en- English (default)es- Spanishfr- Frenchde- Germanit- Italianpt- Portuguesezh- Chinese- And more…
Voice speaker ID or cloned voice file
- For built-in voices: speaker ID (e.g.,
p225,p226) - For cloned voices: path to reference audio file
Use the voice cloning feature to create custom voices from audio samples.
Coqui speech rate multiplier
- Range:
0.5to2.0 1.0= normal speed (default)- Values work the same as Piper speed
Coqui voice quality preset
fast- Faster inference, good qualitybalanced- Balanced speed and quality (default)high- Best quality, slower inference
Coqui emotional toneSame options as Piper:
neutral, calm, happy, excited, serious, sadEnable GPU acceleration for CoquiRequires CUDA-compatible GPU and PyTorch with CUDA support.Benefits:
- 5-10x faster inference
- Lower CPU usage
- Enables real-time TTS for long responses
Split text into sentences before synthesisWhen enabled, long responses are broken into sentences and synthesized separately for more natural pacing.
Voice Installation
- Piper
- Coqui
Piper voices are installed automatically:
- Open Settings > TTS
- Select Piper as TTS engine
- Click “Install Voice” (if not already installed)
- SlasshyWispr downloads and configures the default voice
- Test with “Preview Voice” button
Voice models are stored locally. No internet connection needed after installation.
Voice Cloning (Coqui Only)
Create custom voices from reference audio samples.Requirements
- Coqui TTS engine enabled
- XTTS v2 or compatible cloning model
- Clean audio sample (5-30 seconds recommended)
- Single speaker, minimal background noise
Cloning Process
- Open Settings > TTS > Coqui
- Click “Clone Voice”
- Upload or record reference audio (max 30 seconds)
- Provide a speaker ID name
- SlasshyWispr processes the audio and creates a voice profile
- Test with “Preview Voice” button
- Select the cloned voice from the voice dropdown
Maximum reference audio length: seconds (from
constants.ts:87)Best Practices for Voice Cloning
- Audio quality: Use high-quality recordings (44.1kHz or higher)
- Duration: 10-20 seconds is optimal
- Content: Natural speech, varied intonation
- Environment: Quiet room, no echo or reverb
- Speaker: Single speaker only, consistent volume
- Emotion: Neutral tone for most versatile results
Assistant Name
Display name for the assistant in conversationsThis appears in the home history and TTS announcements.
Troubleshooting
No audio playback
No audio playback
- Check system volume and output device
- Verify TTS engine is properly installed
- Test with “Preview Voice” in settings
- Check SlasshyWispr audio permissions
- Try switching to the other TTS engine
Coqui installation fails
Coqui installation fails
- Verify Python version (3.8 or higher required)
- Install TTS library:
pip install TTS - Check Python path in settings
- Look for error messages in SlasshyWispr logs
- Try installing in a virtual environment
Voice sounds robotic or garbled
Voice sounds robotic or garbled
- Increase quality setting (Piper/Coqui quality)
- Reduce speed multiplier
- For Coqui: enable GPU acceleration if available
- For cloning: use higher quality reference audio
- Try a different voice model
TTS is too slow
TTS is too slow
- Lower quality setting to “fast”
- For Coqui: enable GPU acceleration
- Increase speed multiplier
- Switch to Piper for fastest inference
- Disable sentence splitting (Coqui)