Quickstart Guide
This guide will walk you through creating your first text-to-speech audio with VozCraft. In just a few steps, you’ll be generating professional-quality audio in any of our 22+ supported languages.Estimated Time: 5 minutes to generate your first audio
Prerequisites
Before you begin, ensure you have:- A modern web browser (Chrome, Edge, Safari, or Opera recommended)
- An active internet connection (only for initial page load)
- JavaScript enabled in your browser
- Basic speakers or headphones to hear the generated audio
VozCraft runs entirely in your browser—no account creation or software installation required!
Step 1: Access VozCraft
Navigate to the VozCraft application in your web browser. The application loads instantly and presents you with the main interface:Open the Application
The VozCraft interface displays with a clean, intuitive layout featuring:
- Navigation bar with theme toggle and language selector
- Main control panel on the left
- History panel on the right (desktop) or below (mobile)
Choose Your Theme
Click the ”🌙 Oscuro” / ”☀ Claro” button in the top-right to toggle between dark and light modes. The interface adapts instantly to your preference.
- Dark Mode: Optimal for low-light environments
- Light Mode: Better for bright conditions
Step 2: Configure Voice Settings
The control panel offers four key settings that shape your audio output. Let’s configure each one:🌍 Voice / Accent / Region
Select your desired language and regional accent from the dropdown menu:- Spanish
- English
- Other Languages
Choose from 6 Spanish variants:
- Español (México) 🇲🇽 - Mexican Spanish
- Español (España) 🇪🇸 - Castilian Spanish
- Español (Argentina) 🇦🇷 - Argentine Spanish
- Español (Colombia) 🇨🇴 - Colombian Spanish
- Español (Chile) 🇨🇱 - Chilean Spanish
- Español (Venezuela) 🇻🇪 - Venezuelan Spanish
🎭 Voice Type
Choose between two voice characteristics:- 🔉 Normal Voice: Standard pitch (0.75) - Balanced, natural-sounding voice suitable for most content
- 🔊 High-pitched Voice: Elevated pitch (1.30) - Lighter, more energetic tone ideal for upbeat content
⚡ Speed
Control the playback speed with five options:| Speed | Rate | Best For |
|---|---|---|
| Muy Lento / Very Slow | 0.50x | Learning, careful listening |
| Lento / Slow | 0.75x | Comprehension, note-taking |
| Normal | 1.00x | Standard content, default setting |
| Rápido / Fast | 1.25x | Quick reviews, experienced listeners |
| Muy Rápido / Very Fast | 1.60x | Rapid consumption, time-saving |
The default Normal speed (1.00x) provides the most natural-sounding results for first-time users.
💫 Mood
Shape the emotional character of your audio with eight mood presets:😐 Neutral
Balanced expression
- Pitch: 1.00
- Rate: 1.00x
- Volume: 100%
- Best for: Professional content, news, documentation
😄 Happy / Alegre
High and lively tone
- Pitch: 1.25
- Rate: 1.15x
- Volume: 100%
- Best for: Marketing, children’s content, celebrations
😠 Serious / Serio
Deep, steady and firm
- Pitch: 0.80
- Rate: 0.88x
- Volume: 95%
- Best for: Formal announcements, serious topics
🤩 Enthusiastic / Entusiasta
Very energetic and expressive
- Pitch: 1.35
- Rate: 1.25x
- Volume: 100%
- Best for: Motivational content, sports, excitement
😔 Melancholic / Melancólico
Soft, slow and nostalgic
- Pitch: 0.70
- Rate: 0.78x
- Volume: 88%
- Best for: Poetry, reflective content, memorials
⚡ Energetic / Enérgico
Fast, dynamic and powerful
- Pitch: 1.15
- Rate: 1.30x
- Volume: 100%
- Best for: Action content, workouts, urgent messages
😌 Relaxed / Relajado
Calm and slow-paced
- Pitch: 0.88
- Rate: 0.82x
- Volume: 90%
- Best for: Meditation, bedtime stories, ASMR
😤 Tense / Tenso
Urgent and tense
- Pitch: 1.10
- Rate: 1.18x
- Volume: 95%
- Best for: Thrillers, dramatic narration, alerts
Mood Visualization
After selecting a mood, VozCraft displays a visual indicator showing the mood’s impact on three parameters:- Pitch/Tono: How high or low the voice sounds
- Rate/Velocidad: How fast the voice speaks
- Volume/Volumen: The audio output level
Step 3: Enter Your Text
In the large text area, type or paste the content you want to convert to speech:Input Your Content
Click inside the text area and enter your text. You can:
- Type directly into the field
- Paste content from your clipboard (Ctrl+V / Cmd+V)
- Edit and refine your text before generation
Example Text (Spanish)
Example Text (English)
Step 4: Generate Audio
Once your settings are configured and text is entered:Click Generate Audio
Press the large ”🎧 Generar Audio” / ”🎧 Generate Audio” button at the bottom of the control panel.The button:
- Has a red gradient background
- Animates on hover (lifts slightly)
- Changes to purple with ”⏳ Generando…” / ”⏳ Generating…” during processing
Wait for Generation
VozCraft processes your request instantly:
- A brief 400ms delay ensures smooth transitions
- The Web Speech API begins synthesizing audio
- The button changes to ”⏹ Detener Audio” / ”⏹ Stop Audio” during playback
Instant Playback: VozCraft uses the browser’s built-in speech synthesis for real-time playback—no file generation or server processing required during this step.
Step 5: Review Your History
Every generated audio is automatically saved to your history panel:History Features
Each history item displays:- Custom Name or Timestamp: “Audio · 1/15/2026, 3:45 PM” (default)
- Edit Icon (✏️): Click to rename the audio
- Metadata: Date, voice, voice type, speed, mood
- Text Preview: First line of the generated text
- Action Buttons:
- ▶ Reproducir / Play: Replay the audio with original settings
- MP3: Download as MP3 file
- WAV: Download as WAV file
- 📄 TXT: Download transcript with metadata
- ✕ Delete: Remove from history
Rename Your Audio
Enter a Custom Name
A modal appears with a text input. Type your desired name (up to 80 characters).Example names:
- “Product Demo Script”
- “Spanish Lesson 1”
- “Podcast Intro - Episode 5”
Play from History
Click the ”▶ Reproducir” / ”▶ Play” button on any history item to replay it:- An inline audio player appears below the item
- Visual waveform displays with 32 animated bars
- Progress bar shows current position
- Time display shows current time / total duration
- Click the progress bar to seek to a specific position
- Click the ⏸ Pause button to pause, or ▶ Play to resume
Step 6: Export Your Audio
VozCraft offers multiple export formats for your generated audio:Download MP3
Click the “MP3” button (green) to download compressed audio:- Generates a WAV file and presents it as MP3 (browser compatibility)
- Smaller file size for easy sharing
- Sample rate: 22,050 Hz
- Bit depth: 16-bit
- Filename:
vozcraft-[timestamp].mp3
Download WAV
Click the “WAV” button (amber) to download uncompressed audio:- Full-quality audio with no compression
- Larger file size, suitable for editing
- Sample rate: 22,050 Hz
- Bit depth: 16-bit
- Mono (1 channel)
- Filename:
vozcraft-[timestamp].wav
Download Transcript
Click the ”📄 TXT” button (purple) to download a formatted transcript:Export Process: When you download MP3/WAV files, VozCraft uses the Web Audio API to synthesize a complete audio file with all your custom settings applied, including pitch modifications, formant filtering, and syllable-based amplitude envelopes.
Step 7: Manage Your History
The history panel includes tools for managing your generated audio:Save History
Click ”💾 Guardar” / ”💾 Save” to export your entire history:- Downloads a JSON file:
vozcraft-historial.json - Contains all audio metadata and settings
- Preserves custom names and timestamps
- Can be imported later to restore your history
Load History
Click ”📂 Cargar” / ”📂 Load” to import a previously saved history:- Opens a file picker dialog
- Select a
.jsonhistory file - All entries are loaded and merged with current history
- Success toast shows number of entries loaded
Clear History
Click “Limpiar todo” / “Clear all” to remove all history items:- Removes all entries from the history panel
- Clears browser storage
- Cannot be undone (save history first if needed)
- Shows confirmation toast
Common Quickstart Issues
No sound is playing
No sound is playing
Solutions:
- Check your system volume and ensure it’s not muted
- Verify your browser has permission to play audio
- Try a different browser (Chrome/Edge recommended)
- Check if other websites can play audio
- Restart your browser
Voice sounds robotic or unnatural
Voice sounds robotic or unnatural
Solutions:
- Try the Neutral mood for most natural results
- Use Normal speed (1.00x) instead of faster speeds
- Select Normal Voice type instead of High-pitched
- Some languages have better voice quality than others—try English (US) or Spanish (México) for best results
Wrong language or accent is used
Wrong language or accent is used
Solutions:
- Your system may not have the selected voice installed
- Try a different regional variant (e.g., try English UK if US doesn’t work)
- Check your operating system’s TTS voice settings
- Some browsers have better voice support than others
Cannot download MP3/WAV files
Cannot download MP3/WAV files
Solutions:
- Ensure pop-ups and downloads are allowed for the site
- Check your browser’s download settings
- Verify you have sufficient disk space
- Try a different browser
- Check browser console for errors (F12)
Next Steps
Congratulations! You’ve successfully created your first text-to-speech audio with VozCraft. Here’s what to explore next:Voice Options
Explore all 22+ languages and discover regional accent differences
Customization
Master all customization options for perfect audio
Using VozCraft
Learn advanced workflows and best practices
