Text-to-Speech Setup

SlasshyWispr includes built-in text-to-speech (TTS) for reading assistant responses aloud. Choose between Piper (fast, zero-Python) and Coqui (high-quality, requires Python).

TTS Engine Selection

ttsEngine

TtsEngine

default:"piper"

Text-to-speech engine

piper - Fast, lightweight, zero-dependency TTS (default)
coqui - High-quality neural TTS with voice cloning (requires Python setup)

Coqui is disabled in zero-Python mode. If ZERO_PYTHON_MODE = true, only Piper is available.

Piper Configuration

Piper is the default TTS engine: lightweight, fast, and requires no external dependencies.

piperPath

string

default:""

Path to Piper executableSlasshyWispr automatically installs and configures Piper on first run. This setting is managed internally.

piperSpeed

number

default:"1.08"

Piper speech rate multiplier

Range: 0.5 to 2.0
1.0 = normal speed
1.08 = slightly faster (default)
1.5 = 50% faster
0.8 = 20% slower

piperQuality

PiperQuality

default:"fast"

Piper voice quality preset

fast - Fastest inference, good quality (default)
balanced - Balanced speed and quality
high - Best quality, slower inference

piperEmotion

PiperEmotion

default:"neutral"

Piper emotional tone

neutral - Standard neutral voice (default)
calm - Calming, relaxed tone
happy - Upbeat, cheerful tone
excited - Energetic, enthusiastic tone
serious - Professional, formal tone
sad - Somber, low-energy tone

Emotion support depends on the installed voice model. Not all emotions may be available.

Coqui Configuration

Coqui TTS provides studio-quality neural voices with voice cloning capabilities.

Coqui requires Python and additional setup. It is disabled when ZERO_PYTHON_MODE = true.

coquiPythonPath

string

default:""

Path to Python executable for CoquiMust point to a Python installation with TTS library installed:

pip install TTS

coquiModelName

string

default:"tts_models/multilingual/multi-dataset/xtts_v2"

Coqui TTS model identifierDefault model is XTTS v2, a multilingual neural TTS model.Other options:

tts_models/en/ljspeech/tacotron2-DDC
tts_models/en/vctk/vits
See Coqui model list: tts --list_models

coquiLanguage

string

default:"en"

Language code for Coqui TTSSupported languages depend on the model. XTTS v2 supports:

en - English (default)
es - Spanish
fr - French
de - German
it - Italian
pt - Portuguese
zh - Chinese
And more…

coquiVoiceId

string

default:""

Voice speaker ID or cloned voice file

For built-in voices: speaker ID (e.g., p225, p226)
For cloned voices: path to reference audio file

Use the voice cloning feature to create custom voices from audio samples.

coquiSpeed

number

default:"1.0"

Coqui speech rate multiplier

Range: 0.5 to 2.0
1.0 = normal speed (default)
Values work the same as Piper speed

coquiQuality

CoquiQuality

default:"balanced"

Coqui voice quality preset

fast - Faster inference, good quality
balanced - Balanced speed and quality (default)
high - Best quality, slower inference

coquiEmotion

CoquiEmotion

default:"neutral"

Coqui emotional toneSame options as Piper: neutral, calm, happy, excited, serious, sad

coquiUseGpu

boolean

default:"false"

Enable GPU acceleration for CoquiRequires CUDA-compatible GPU and PyTorch with CUDA support.Benefits:

5-10x faster inference
Lower CPU usage
Enables real-time TTS for long responses

coquiSplitSentences

boolean

default:"false"

Split text into sentences before synthesisWhen enabled, long responses are broken into sentences and synthesized separately for more natural pacing.

Voice Installation

Piper
Coqui

Piper voices are installed automatically:

Open Settings > TTS
Select Piper as TTS engine
Click “Install Voice” (if not already installed)
SlasshyWispr downloads and configures the default voice
Test with “Preview Voice” button

Voice models are stored locally. No internet connection needed after installation.

Coqui requires manual Python setup:

Install Python 3.8+ (if not already installed)
Install TTS library:
```
pip install TTS
```
Open Settings > TTS
Select Coqui as TTS engine
Point to Python executable path
SlasshyWispr validates the installation

Download a voice model:

tts --list_models
tts --model_name tts_models/multilingual/multi-dataset/xtts_v2 --text "Test" --out_path test.wav

First run downloads the model (may take several minutes).

Voice Cloning (Coqui Only)

Create custom voices from reference audio samples.

Requirements

Coqui TTS engine enabled
XTTS v2 or compatible cloning model
Clean audio sample (5-30 seconds recommended)
Single speaker, minimal background noise

Cloning Process

Open Settings > TTS > Coqui
Click “Clone Voice”
Upload or record reference audio (max 30 seconds)
Provide a speaker ID name
SlasshyWispr processes the audio and creates a voice profile
Test with “Preview Voice” button
Select the cloned voice from the voice dropdown

Maximum reference audio length: seconds (from constants.ts:87)

Best Practices for Voice Cloning

Audio quality: Use high-quality recordings (44.1kHz or higher)
Duration: 10-20 seconds is optimal
Content: Natural speech, varied intonation
Environment: Quiet room, no echo or reverb
Speaker: Single speaker only, consistent volume
Emotion: Neutral tone for most versatile results

Assistant Name

assistantName

string

default:"Lily"

Display name for the assistant in conversationsThis appears in the home history and TTS announcements.

Troubleshooting

No audio playback

Check system volume and output device
Verify TTS engine is properly installed
Test with “Preview Voice” in settings
Check SlasshyWispr audio permissions
Try switching to the other TTS engine

Coqui installation fails

Verify Python version (3.8 or higher required)
Install TTS library: pip install TTS
Check Python path in settings
Look for error messages in SlasshyWispr logs
Try installing in a virtual environment

Voice sounds robotic or garbled

Increase quality setting (Piper/Coqui quality)
Reduce speed multiplier
For Coqui: enable GPU acceleration if available
For cloning: use higher quality reference audio
Try a different voice model

TTS is too slow

Lower quality setting to “fast”
For Coqui: enable GPU acceleration
Increase speed multiplier
Switch to Piper for fastest inference
Disable sentence splitting (Coqui)

Get Started

Core Features

Configuration

Local Models

Productivity Tools

Guides

TTS Engine Selection

Piper Configuration

Coqui Configuration

Voice Installation

Voice Cloning (Coqui Only)

Requirements

Cloning Process

Best Practices for Voice Cloning

Assistant Name

Troubleshooting

Build docs developers (and LLMs) love

Get Started

Core Features

Configuration

Local Models

Productivity Tools

Guides

​TTS Engine Selection

​Piper Configuration

​Coqui Configuration

​Voice Installation

​Voice Cloning (Coqui Only)

​Requirements

​Cloning Process

​Best Practices for Voice Cloning

​Assistant Name

​Troubleshooting

Build docs developers (and LLMs) love

TTS Engine Selection

Piper Configuration

Coqui Configuration

Voice Installation

Voice Cloning (Coqui Only)

Requirements

Cloning Process

Best Practices for Voice Cloning

Assistant Name

Troubleshooting