Welcome to Resonance
The open-source alternative to ElevenLabs. Generate natural-sounding speech and clone voices instantly with AI-powered text-to-speech built on Next.js 16 and Chatterbox TTS.
Text-to-Speech
Natural AI voices
Voice Cloning
10s sample, instant clone
20 Built-in Voices
12 categories, 5 locales
Quick start
Get Resonance running locally in minutes
Configure environment variables
Copy the example environment file and add your credentials:
Required environment variables
Required environment variables
Set up the database and deploy TTS engine
Run migrations and deploy the Chatterbox TTS model to Modal:
The Modal deployment runs on a serverless NVIDIA A10G GPU. First requests after inactivity may experience cold starts.
Seed voices and start the app
Populate the database with 20 built-in voices and start the development server:Open http://localhost:3000 to see Resonance in action.
Core features
Everything you need for production-ready text-to-speech
Text-to-Speech
Generate speech from text with adjustable creativity, variety, expression, and flow parameters
Voice Cloning
Upload or record a 10-second sample and clone any voice instantly with zero-shot learning
Voice Library
Browse 20 pre-seeded system voices and manage your custom cloned voices
Generation History
Access past generations with preserved voice metadata and waveform visualization
Multi-Tenant
Team-based access via Clerk Organizations with complete data isolation
Usage Billing
Pay-as-you-go character metering with configurable pricing via Polar
Explore by topic
Deep dive into specific areas
Ready to build with Resonance?
Follow the quickstart guide to get up and running in under 10 minutes
Get Started Now