Quickstart Guide

This guide will walk you through creating your first text-to-speech audio with VozCraft. In just a few steps, you’ll be generating professional-quality audio in any of our 22+ supported languages.

Estimated Time: 5 minutes to generate your first audio

Prerequisites

Before you begin, ensure you have:

A modern web browser (Chrome, Edge, Safari, or Opera recommended)
An active internet connection (only for initial page load)
JavaScript enabled in your browser
Basic speakers or headphones to hear the generated audio

VozCraft runs entirely in your browser—no account creation or software installation required!

Step 1: Access VozCraft

Navigate to the VozCraft application in your web browser. The application loads instantly and presents you with the main interface:

Open the Application

The VozCraft interface displays with a clean, intuitive layout featuring:

Navigation bar with theme toggle and language selector
Main control panel on the left
History panel on the right (desktop) or below (mobile)

Choose Your Theme

Click the ”🌙 Oscuro” / ”☀ Claro” button in the top-right to toggle between dark and light modes. The interface adapts instantly to your preference.

Dark Mode: Optimal for low-light environments
Light Mode: Better for bright conditions

Select Your Language

Click the ”🇪🇸 ES” / ”🇺🇸 EN” button to switch the interface language between Spanish and English. All labels, buttons, and messages update immediately.

Step 2: Configure Voice Settings

The control panel offers four key settings that shape your audio output. Let’s configure each one:

🌍 Voice / Accent / Region

Select your desired language and regional accent from the dropdown menu:

Spanish
English
Other Languages

Choose from 6 Spanish variants:

Español (México) 🇲🇽 - Mexican Spanish
Español (España) 🇪🇸 - Castilian Spanish
Español (Argentina) 🇦🇷 - Argentine Spanish
Español (Colombia) 🇨🇴 - Colombian Spanish
Español (Chile) 🇨🇱 - Chilean Spanish
Español (Venezuela) 🇻🇪 - Venezuelan Spanish

First-time users: Start with your native language to familiarize yourself with the interface, then experiment with other languages.

🎭 Voice Type

Choose between two voice characteristics:

🔉 Normal Voice: Standard pitch (0.75) - Balanced, natural-sounding voice suitable for most content
🔊 High-pitched Voice: Elevated pitch (1.30) - Lighter, more energetic tone ideal for upbeat content

Click either button to toggle between the two options. The selected option displays with a blue gradient background.

⚡ Speed

Control the playback speed with five options:

Speed	Rate	Best For
Muy Lento / Very Slow	0.50x	Learning, careful listening
Lento / Slow	0.75x	Comprehension, note-taking
Normal	1.00x	Standard content, default setting
Rápido / Fast	1.25x	Quick reviews, experienced listeners
Muy Rápido / Very Fast	1.60x	Rapid consumption, time-saving

The default Normal speed (1.00x) provides the most natural-sounding results for first-time users.

💫 Mood

Shape the emotional character of your audio with eight mood presets:

😐 Neutral

Balanced expression

Pitch: 1.00
Rate: 1.00x
Volume: 100%
Best for: Professional content, news, documentation

😄 Happy / Alegre

High and lively tone

Pitch: 1.25
Rate: 1.15x
Volume: 100%
Best for: Marketing, children’s content, celebrations

😠 Serious / Serio

Deep, steady and firm

Pitch: 0.80
Rate: 0.88x
Volume: 95%
Best for: Formal announcements, serious topics

🤩 Enthusiastic / Entusiasta

Very energetic and expressive

Pitch: 1.35
Rate: 1.25x
Volume: 100%
Best for: Motivational content, sports, excitement

😔 Melancholic / Melancólico

Soft, slow and nostalgic

Pitch: 0.70
Rate: 0.78x
Volume: 88%
Best for: Poetry, reflective content, memorials

⚡ Energetic / Enérgico

Fast, dynamic and powerful

Pitch: 1.15
Rate: 1.30x
Volume: 100%
Best for: Action content, workouts, urgent messages

😌 Relaxed / Relajado

Calm and slow-paced

Pitch: 0.88
Rate: 0.82x
Volume: 90%
Best for: Meditation, bedtime stories, ASMR

😤 Tense / Tenso

Urgent and tense

Pitch: 1.10
Rate: 1.18x
Volume: 95%
Best for: Thrillers, dramatic narration, alerts

Mood Visualization

After selecting a mood, VozCraft displays a visual indicator showing the mood’s impact on three parameters:

Pitch/Tono: How high or low the voice sounds
Rate/Velocidad: How fast the voice speaks
Volume/Volumen: The audio output level

Each parameter shows a progress bar indicating its relative intensity from 0-100%.

Step 3: Enter Your Text

In the large text area, type or paste the content you want to convert to speech:

Input Your Content

Click inside the text area and enter your text. You can:

Type directly into the field
Paste content from your clipboard (Ctrl+V / Cmd+V)
Edit and refine your text before generation

Monitor Character Count

The bottom-right corner displays your character count: X/5000

Maximum: 5,000 characters per generation
Counter turns red when exceeding 4,500 characters
Split longer content into multiple generations if needed

Character Limit: VozCraft supports up to 5,000 characters per audio generation. For longer documents, split your content into logical sections.

Example Text (Spanish)

Bienvenido a VozCraft, la aplicación de texto a voz más avanzada. 
Con VozCraft puedes generar audio profesional en más de veinte idiomas 
con control total sobre velocidad, tono y estado de ánimo.

Example Text (English)

Welcome to VozCraft, the most advanced text-to-speech application. 
With VozCraft you can generate professional audio in over twenty languages 
with complete control over speed, pitch, and mood.

Step 4: Generate Audio

Once your settings are configured and text is entered:

Click Generate Audio

Press the large ”🎧 Generar Audio” / ”🎧 Generate Audio” button at the bottom of the control panel.The button:

Has a red gradient background
Animates on hover (lifts slightly)
Changes to purple with ”⏳ Generando…” / ”⏳ Generating…” during processing

Wait for Generation

VozCraft processes your request instantly:

A brief 400ms delay ensures smooth transitions
The Web Speech API begins synthesizing audio
The button changes to ”⏹ Detener Audio” / ”⏹ Stop Audio” during playback

Listen to Your Audio

The audio begins playing automatically through your default audio output.During playback:

The generate button becomes a stop button
A success toast notification appears: ”✓ Audio generado correctamente”
The audio is automatically added to your history

Instant Playback: VozCraft uses the browser’s built-in speech synthesis for real-time playback—no file generation or server processing required during this step.

Step 5: Review Your History

Every generated audio is automatically saved to your history panel:

History Features

Each history item displays:

Custom Name or Timestamp: “Audio · 1/15/2026, 3:45 PM” (default)
Edit Icon (✏️): Click to rename the audio
Metadata: Date, voice, voice type, speed, mood
Text Preview: First line of the generated text
Action Buttons:
- ▶ Reproducir / Play: Replay the audio with original settings
- MP3: Download as MP3 file
- WAV: Download as WAV file
- 📄 TXT: Download transcript with metadata
- ✕ Delete: Remove from history

Rename Your Audio

Click the Edit Icon

Click the pencil icon (✏️) next to the audio name in the history item.

Enter a Custom Name

A modal appears with a text input. Type your desired name (up to 80 characters).Example names:

“Product Demo Script”
“Spanish Lesson 1”
“Podcast Intro - Episode 5”

Save or Cancel

Click “Guardar” / “Save” to confirm
Click “Cancelar” / “Cancel” or press ESC to abort
Press ENTER in the input field to quick-save

Play from History

Click the ”▶ Reproducir” / ”▶ Play” button on any history item to replay it:

An inline audio player appears below the item
Visual waveform displays with 32 animated bars
Progress bar shows current position
Time display shows current time / total duration
Click the progress bar to seek to a specific position
Click the ⏸ Pause button to pause, or ▶ Play to resume

Pro Tip: The audio player calculates duration based on text length and settings, providing accurate time estimates even before export.

Step 6: Export Your Audio

VozCraft offers multiple export formats for your generated audio:

Download MP3

Click the “MP3” button (green) to download compressed audio:

Generates a WAV file and presents it as MP3 (browser compatibility)
Smaller file size for easy sharing
Sample rate: 22,050 Hz
Bit depth: 16-bit
Filename: vozcraft-[timestamp].mp3

Download WAV

Click the “WAV” button (amber) to download uncompressed audio:

Full-quality audio with no compression
Larger file size, suitable for editing
Sample rate: 22,050 Hz
Bit depth: 16-bit
Mono (1 channel)
Filename: vozcraft-[timestamp].wav

Download Transcript

Click the ”📄 TXT” button (purple) to download a formatted transcript:

VozCraft — Transcripción de Audio
══════════════════════════════════════════════════
Nombre:     My Audio Name
Fecha:      15 de enero de 2026
Hora:       3:45:00 p. m.
Idioma:     Español (México)
Género:     Voz Normal
Velocidad:  Normal
Ánimo:      Alegre
══════════════════════════════════════════════════
TRANSCRIPCIÓN:

[Your full text content here]

══════════════════════════════════════════════════
© VozCraft · mateoRiosdev · 2026

Export Process: When you download MP3/WAV files, VozCraft uses the Web Audio API to synthesize a complete audio file with all your custom settings applied, including pitch modifications, formant filtering, and syllable-based amplitude envelopes.

Step 7: Manage Your History

The history panel includes tools for managing your generated audio:

Save History

Click ”💾 Guardar” / ”💾 Save” to export your entire history:

Downloads a JSON file: vozcraft-historial.json
Contains all audio metadata and settings
Preserves custom names and timestamps
Can be imported later to restore your history

Load History

Click ”📂 Cargar” / ”📂 Load” to import a previously saved history:

Opens a file picker dialog
Select a .json history file
All entries are loaded and merged with current history
Success toast shows number of entries loaded

Clear History

Click “Limpiar todo” / “Clear all” to remove all history items:

Removes all entries from the history panel
Clears browser storage
Cannot be undone (save history first if needed)
Shows confirmation toast

Data Persistence: History is stored in your browser’s local storage. Clearing browser data or using incognito mode will prevent history from persisting between sessions.

Common Quickstart Issues

No sound is playing

Solutions:

Check your system volume and ensure it’s not muted
Verify your browser has permission to play audio
Try a different browser (Chrome/Edge recommended)
Check if other websites can play audio
Restart your browser

Voice sounds robotic or unnatural

Solutions:

Try the Neutral mood for most natural results
Use Normal speed (1.00x) instead of faster speeds
Select Normal Voice type instead of High-pitched
Some languages have better voice quality than others—try English (US) or Spanish (México) for best results

Wrong language or accent is used

Solutions:

Your system may not have the selected voice installed
Try a different regional variant (e.g., try English UK if US doesn’t work)
Check your operating system’s TTS voice settings
Some browsers have better voice support than others

Cannot download MP3/WAV files

Solutions:

Ensure pop-ups and downloads are allowed for the site
Check your browser’s download settings
Verify you have sufficient disk space
Try a different browser
Check browser console for errors (F12)

Next Steps

Congratulations! You’ve successfully created your first text-to-speech audio with VozCraft. Here’s what to explore next:

Voice Options

Explore all 22+ languages and discover regional accent differences

Customization

Master all customization options for perfect audio

Using VozCraft

Learn advanced workflows and best practices

Pro Workflow: Try creating a “template” by generating audio with your preferred settings, then use those same settings for future content by selecting that history item and modifying only the text.

Get Started

Features

Guides

​Quickstart Guide

​Prerequisites

​Step 1: Access VozCraft

​Step 2: Configure Voice Settings

​🌍 Voice / Accent / Region

​🎭 Voice Type

​⚡ Speed

​💫 Mood

😐 Neutral

😄 Happy / Alegre

😠 Serious / Serio

🤩 Enthusiastic / Entusiasta

😔 Melancholic / Melancólico

⚡ Energetic / Enérgico

😌 Relaxed / Relajado

😤 Tense / Tenso

​Mood Visualization

​Step 3: Enter Your Text

​Example Text (Spanish)

​Example Text (English)

​Step 4: Generate Audio

​Step 5: Review Your History

​History Features

​Rename Your Audio

​Play from History

​Step 6: Export Your Audio

​Download MP3

​Download WAV

​Download Transcript

​Step 7: Manage Your History

​Save History

​Load History

​Clear History

​Common Quickstart Issues

​Next Steps

Voice Options

Customization

Using VozCraft

Build docs developers (and LLMs) love

Quickstart Guide

Prerequisites

Step 1: Access VozCraft

Step 2: Configure Voice Settings

🌍 Voice / Accent / Region

🎭 Voice Type

⚡ Speed

💫 Mood

Mood Visualization

Step 3: Enter Your Text

Example Text (Spanish)

Example Text (English)

Step 4: Generate Audio

Step 5: Review Your History

History Features

Rename Your Audio

Play from History

Step 6: Export Your Audio

Download MP3

Download WAV

Download Transcript

Step 7: Manage Your History

Save History

Load History

Clear History

Common Quickstart Issues

Next Steps