Voice-Activated Mode
In voice-activated mode, Klaus continuously listens and automatically detects when you start and stop speaking.How It Works
Klaus is always listening
When the status bar shows ● Ready (green), Klaus is actively monitoring for speech.
You start speaking
Klaus detects your voice using WebRTC Voice Activity Detection (VAD) and begins recording. The status changes to ● Listening (blue).
Advantages
- Hands-free operation - no need to hold a key
- Natural flow - just speak when you have a question
- Fast study loops - minimal friction between questions
Quality Filtering
Voice-activated mode includes multiple layers of quality filtering to reject background noise and false triggers:- Voiced ratio - at least 28% of the audio must contain speech (not just noise)
- Minimum voiced frames - at least 8 frames of 30ms must contain speech
- Minimum duration - utterances shorter than 0.5 seconds are discarded
- RMS loudness - audio must be louder than -45 dBFS (not just a whisper or hum)
- Contiguous voiced run - at least 6 consecutive voiced frames (filters out clicks and pops)
~/.klaus/config.toml if you’re experiencing false triggers or missed questions.
When to Use Voice-Activated Mode
- You’re in a quiet environment with minimal background noise
- You want hands-free operation
- You’re reading from a physical book and need both hands free
- You prefer a conversational, natural flow
Push-to-Talk Mode
In push-to-talk mode, you hold a hotkey to record your question and release it to send.How It Works
Hold the hotkey
Press and hold your push-to-talk key (default:
§ on macOS, F2 on Windows). The status changes to ● Listening (blue).Advantages
- Precise control - you decide exactly when to record
- Works in noisy environments - no risk of false triggers
- No silence timeout - you can pause mid-question without it being finalized
- Instant send - no waiting for silence detection
When to Use Push-to-Talk Mode
- You’re in a noisy environment (coffee shop, library, shared office)
- You want precise control over when Klaus listens
- You’re asking complex questions and need time to think mid-sentence
- You’re testing Klaus or demonstrating it to others
Switching Between Modes
You can toggle between voice-activated and push-to-talk at any time:Using the Status Bar
Click the mode button in the status bar:- PTT - currently in push-to-talk mode, click to switch to voice-activated
- Voice - currently in voice-activated mode, click to switch to push-to-talk
Using the Toggle Hotkey
Press your toggle key (default:§ on macOS, F3 on Windows) to switch modes.
Configuring Voice-Activated Settings
All VAD settings live in~/.klaus/config.toml. Edit the file and restart Klaus to apply changes.
Sensitivity (0-3)
3 (most aggressive).
- 0 - least aggressive, may pick up more background noise
- 3 - most aggressive, stricter about what counts as speech
Silence Timeout (seconds)
1.5 seconds.
- Shorter (e.g.,
1.0) - faster response, but may cut off if you pause mid-sentence - Longer (e.g.,
2.0) - more forgiving of pauses, but adds latency
Minimum Duration (seconds)
0.5 seconds.
Minimum Voiced Ratio (0.0-1.0)
0.28 (28%).
Minimum Voiced Frames
8 frames (240ms of speech).
Minimum RMS Loudness (dBFS)
-45.0.
- Higher (e.g.,
-40.0) - stricter, rejects quieter audio - Lower (e.g.,
-50.0) - more permissive, accepts quieter speech
Minimum Voiced Run Frames
6 frames (180ms). Helps reject clicks and pops.
Configuring Push-to-Talk Hotkey
Edit~/.klaus/config.toml and restart Klaus:
- Function keys:
F1,F2,F3, …,F12 - Single characters:
a,b,§,~, etc. - Special keys:
space,tab,esc
On macOS, the section key
§ (to the left of 1) is the default. On Windows, F2 is the default.Configuring Toggle Hotkey
Edit~/.klaus/config.toml and restart Klaus:
Troubleshooting
Voice-Activated Mode Not Detecting Speech
- Check the status bar - if it shows ”● Idle” instead of ”● Ready”, you’re in push-to-talk mode. Click the mode button to switch.
- Lower the sensitivity - try
vad_sensitivity = 2inconfig.toml - Lower the RMS threshold - try
vad_min_rms_dbfs = -50.0 - Check your microphone - open Settings and verify the correct mic is selected
Voice-Activated Mode Picking Up Background Noise
- Increase the sensitivity - try
vad_sensitivity = 3(if not already) - Increase the voiced ratio - try
vad_min_voiced_ratio = 0.35 - Increase the RMS threshold - try
vad_min_rms_dbfs = -40.0 - Switch to push-to-talk mode for noisy environments
Voice-Activated Mode Cutting Off Mid-Sentence
- Increase the silence timeout - try
vad_silence_timeout = 2.0 - Speak more continuously - avoid long pauses in the middle of your question
- Use push-to-talk mode for complex, multi-part questions
Push-to-Talk Hotkey Not Working
- Check if the Klaus window is focused - in-app hotkeys only work when Klaus is in the foreground
- On macOS, grant Accessibility permission for global hotkeys to work when Klaus is in the background
- Try a different key - some keys may conflict with system shortcuts (e.g., F3 opens Mission Control on macOS)
- Check the logs - Klaus logs hotkey listener startup and failures at
INFOlevel