Voice to text for Linux
Push-to-talk voice transcription with offline processing, GPU acceleration, and native Wayland integration. Transform speech into text instantly without sending data to the cloud.
Quick Start
Get Voxtype running on your Linux system in minutes
Install Voxtype
.deb or .rpm packages for your distribution.Install dependencies
wtype for best Unicode/CJK support on Wayland.Download transcription model
base.en model provides a good balance of speed and accuracy.Configure compositor keybinding
Hyprland configuration
Hyprland configuration
~/.config/hypr/hyprland.conf:Sway configuration
Sway configuration
~/.config/sway/config:River configuration
River configuration
~/.config/river/init:Key Features
Everything you need for voice-to-text on Linux
Offline transcription
GPU acceleration
Wayland native
Meeting mode
Multiple output modes
Text processing
Explore by Topic
Dive deeper into Voxtype’s capabilities
Basic Usage
Learn push-to-talk controls, toggle mode, and how to configure your preferred hotkey.
Configuration
Customize models, audio settings, output behavior, and post-processing with TOML configuration.
Transcription Engines
Choose from 7 engines: Whisper, Parakeet, Moonshine, SenseVoice, Paraformer, Dolphin, and Omnilingual.
Architecture
Understand Voxtype’s design: hotkey detection, audio capture, transcription backends, and output drivers.
Ready to get started?
Install Voxtype on your Linux system and start transcribing speech with push-to-talk voice recognition.
Get Started