Skip to main content

Using VozCraft - Complete Guide

This comprehensive guide covers complete workflows for using VozCraft effectively, from basic generation to advanced techniques for professional audio production. Whether you’re creating content for education, business, or entertainment, this guide will help you maximize VozCraft’s capabilities.

Application Workflow Overview

VozCraft follows a simple, intuitive workflow:
1

Access VozCraft

Open the application in your browser—no login or installation required.
2

Configure Settings

Choose language, voice type, speed, and mood for your audio.
3

Enter Text

Type or paste your content (up to 5,000 characters).
4

Generate Audio

Click the generate button to create audio instantly.
5

Listen & Review

Play back the audio to verify it meets your needs.
6

Export Audio

Download as MP3, WAV, or save transcript as TXT.
7

Manage History

Rename, organize, or export your audio library.

Basic Workflows

Workflow 1: Quick Audio Generation

Use Case: Generate a single audio file quickly Time Required: 2-3 minutes
1

Open VozCraft

Navigate to VozCraft in your web browser.
2

Select Language

Click the 🌍 Voice / Accent / Region dropdown and select your target language.Example: Choose “English (US)” for American English.
3

Keep Default Settings

For quick generation, use defaults:
  • Voice Type: Normal
  • Speed: Normal
  • Mood: Neutral
4

Enter Your Text

Type or paste your content into the text area. For a quick test:
Welcome to VozCraft. This is a test of the text-to-speech system.
5

Generate

Click ”🎧 Generate Audio” and listen to the result.
6

Export if Satisfied

Click “MP3” to download your audio file.
Quick Tip: For most general content, the default settings (Normal voice, Normal speed, Neutral mood) provide excellent results.

Workflow 2: Creating Multiple Audio Files

Use Case: Generate a series of related audio files (e.g., course lessons, podcast episodes) Time Required: 5-10 minutes per file
1

Configure Base Settings

Set up your preferred voice configuration once:
  • Language: Your content language
  • Voice Type: Choose based on content style
  • Speed: Consider your audience
  • Mood: Match your content tone
2

Generate First Audio

  1. Paste first text
  2. Click Generate Audio
  3. Listen to verify settings
  4. Adjust if needed
3

Rename in History

Click the ✏️ icon and name it descriptively:
Lesson 1: Introduction
4

Generate Subsequent Files

For each additional file:
  1. Clear text area
  2. Paste next content
  3. Generate (settings persist)
  4. Rename in history
Names like:
Lesson 2: Basic Concepts
Lesson 3: Advanced Techniques
5

Bulk Export

Once all files are generated:
  1. Export each as MP3/WAV
  2. Export history as JSON (backup)
  3. Save transcript files if needed
Settings Persistence: VozCraft remembers your last-used settings, making it easy to generate multiple files with consistent voice characteristics.

Workflow 3: Testing Different Voice Options

Use Case: Experiment with different voices to find the best fit Time Required: 10-15 minutes
1

Prepare Test Text

Write a representative sample (100-200 characters) that includes:
  • Typical vocabulary from your content
  • Natural sentence structure
  • Any special terms or names
Example:
Welcome to our product demo. Today we'll explore the key features
that make our solution unique. Let's get started with the basics.
2

Test Language Variants

Generate audio with different accents:
  1. English (US)
  2. English (UK)
  3. English (AU)
Listen to each and note:
  • Clarity
  • Accent appropriateness
  • Pronunciation accuracy
3

Test Voice Types

Using your preferred language:
  1. Generate with Normal Voice
  2. Generate with High-pitched Voice
Compare:
  • Which sounds more professional?
  • Which matches your brand?
  • Which is more engaging?
4

Test Moods

Try 3-4 relevant moods:
  • Neutral (baseline)
  • Happy (if content is positive)
  • Serious (if content is formal)
  • Energetic (if content is dynamic)
5

Select Winner

Choose the combination that:
  • Sounds most natural
  • Matches your content tone
  • Appeals to your audience
  • Meets your quality standards
6

Document Settings

Note your final settings:
  • Language: _______
  • Voice Type: _______
  • Speed: _______
  • Mood: _______
Use these for all future content.

Advanced Workflows

Workflow 4: Long-Form Content Production

Use Case: Creating audiobook chapters, long articles, or extensive documentation Challenges: 5,000 character limit, maintaining consistency Solution: Split content intelligently and merge later
1

Prepare Your Content

  1. Write complete content in text editor
  2. Check total length (character count)
  3. Plan splits at natural breakpoints:
    • End of paragraphs
    • Section breaks
    • Logical thought boundaries
Formula: numberOfSegments = totalCharacters / 4500(Use 4,500 to leave buffer for safety)
2

Split Text Strategically

Create segments that:
  • End at sentence boundaries
  • Don’t exceed 5,000 characters
  • Have slight overlap for smooth merging
Example split points:
Segment 1: [0-4500] chars - ends at "...and that concludes section one."
Segment 2: [4480-8980] chars - starts 20 chars before end of Seg 1
3

Generate All Segments

For each segment:
  1. Paste text into VozCraft
  2. Verify character count is under 5,000
  3. Generate audio
  4. Rename clearly: “Chapter_3_Segment_1”, “Chapter_3_Segment_2”, etc.
  5. Export as WAV (for later editing)
4

Export and Organize

  1. Export all segments as WAV
  2. Export history JSON as backup
  3. Organize files in project folder:
Project/
├── Chapter_3_Segment_1.wav
├── Chapter_3_Segment_2.wav
├── Chapter_3_Segment_3.wav
└── vozcraft-history.json
5

Merge in Audio Editor (Optional)

Using Audacity or similar:
  1. Import all segments
  2. Arrange on timeline
  3. Add slight crossfade between segments (1-2 seconds)
  4. Normalize volume across all segments
  5. Export final combined file
Consistency is Key: Use identical settings (voice, speed, mood) for all segments to ensure seamless transitions.

Workflow 5: Multilingual Content Creation

Use Case: Creating the same content in multiple languages Time Required: 5-10 minutes per language
1

Prepare Translations

Translate your content into target languages:
  • Use professional translation service
  • Or Google Translate for basic needs
  • Verify translations with native speakers if possible
2

Create Reference Matrix

Document your plan:
LanguageVoice TypeSpeedMoodNotes
English (US)NormalNormalNeutralDefault
Español (México)NormalNormalNeutralMatch English
FrançaisNormalSlowNeutralSlower for clarity
3

Generate Each Language

For each language:
  1. Select language in VozCraft
  2. Paste translated text
  3. Adjust settings per your matrix
  4. Generate and listen
  5. Rename: “ProductDemo_EN”, “ProductDemo_ES”, etc.
4

Export Organized Files

Structure your exports:
ProductDemo/
├── EN/
│   ├── ProductDemo_EN.mp3
│   └── ProductDemo_EN.txt
├── ES/
│   ├── ProductDemo_ES.mp3
│   └── ProductDemo_ES.txt
└── FR/
    ├── ProductDemo_FR.mp3
    └── ProductDemo_FR.txt
5

Quality Check

Have native speakers review:
  • Pronunciation accuracy
  • Naturalness
  • Appropriate voice characteristics
  • Cultural appropriateness

Workflow 6: Voice Characterization

Use Case: Creating distinct character voices for storytelling, role-play, or educational scenarios Example: Creating a story with 3 characters
1

Define Character Profiles

Plan voice characteristics:Character 1: Narrator
  • Voice: Normal
  • Speed: Normal
  • Mood: Neutral
  • Purpose: Authoritative, clear
Character 2: Young Character
  • Voice: High-pitched
  • Speed: Fast
  • Mood: Happy
  • Purpose: Energetic, youthful
Character 3: Wise Elder
  • Voice: Normal
  • Speed: Slow
  • Mood: Serious
  • Purpose: Thoughtful, experienced
2

Separate Dialogue

Split your script by speaker:
Narrator_01.txt: "Once upon a time, in a faraway land..."
Character2_01.txt: "Wow! Look at that amazing castle!"
Character3_01.txt: "Patience, young one. All in good time."
Narrator_02.txt: "And so the adventure began..."
3

Generate Each Voice

For each text segment:
  1. Configure settings for that character
  2. Generate audio
  3. Rename with character name and sequence number
  4. Export as WAV for editing
4

Assemble in Audio Editor

Using Audacity:
  1. Import all character audio files
  2. Arrange in story sequence on timeline
  3. Add pauses between speakers (0.5-1 second)
  4. Add background music or sound effects
  5. Export final story
Voice Contrast: Maximize differences between characters by using contrasting settings (High-pitched vs Normal, Fast vs Slow) for clear distinction.

Specialized Use Cases

Use Case: Language Learning Content

Goal: Create audio for language learners with optimal clarity Recommended Settings:
  • Speed: Slow or Very Slow (0.75x or 0.50x)
  • Voice Type: Normal (clearer pronunciation)
  • Mood: Neutral (no emotional distraction)
  • Approach: Pronunciation practice first, comprehension practice later
Workflow:
Settings: Very Slow + Normal + NeutralContent Format:
Word. Pause. Sentence.

Apple. (pause) I have an apple.
Book. (pause) She is reading a book.
Process:
  1. Generate with maximum clarity (Very Slow)
  2. Export as MP3
  3. Use in flashcard apps or learning materials
Benefit: Learners hear every sound clearly
Settings: Normal + Normal + NeutralContent Format:
Natural sentences and paragraphs at conversational speed.

The weather today is sunny and warm. It's a perfect day
for a walk in the park.
Process:
  1. Generate at normal conversational speed
  2. Test learner comprehension
  3. Generate slow version for review if needed
Benefit: Realistic listening practice
Create 3 Versions:
  1. Version A: Very Slow (0.50x) - Beginners
  2. Version B: Slow (0.75x) - Intermediate
  3. Version C: Normal (1.00x) - Advanced
Same Content, Different Speeds:
  • Learners progress through versions
  • Builds confidence gradually
  • Accommodates different skill levels

Use Case: Podcast Intro/Outro

Goal: Create professional podcast intro and outro segments Recommended Settings:
  • Intro: High-pitched + Fast + Enthusiastic (energetic welcome)
  • Outro: Normal + Normal + Neutral (professional closing)
Workflow:
1

Write Scripts

Intro Script (30-45 seconds):
Welcome to [Podcast Name]! The show where we explore [topic].
I'm your host [Name], and today we're diving into [episode topic].
Let's get started!
Outro Script (20-30 seconds):
Thanks for listening to [Podcast Name]. If you enjoyed this episode,
please subscribe and leave a review. Until next time!
2

Generate with Appropriate Mood

  • Intro: Use Enthusiastic or Happy for energy
  • Outro: Use Neutral for professional close
3

Export as WAV

Export high-quality WAV files for podcast production
4

Enhance in Audio Editor

  1. Add intro/outro music
  2. Apply compression for consistency
  3. Add reverb for polish (subtle)
  4. Normalize volume to -16 LUFS (podcast standard)
  5. Export final versions
5

Reuse for Every Episode

Use the same intro/outro files across episodes for brand consistency

Use Case: IVR / Phone System

Goal: Create phone system prompts and menus Recommended Settings:
  • Voice Type: Normal (professional)
  • Speed: Slow (clarity over phone)
  • Mood: Neutral or Serious (professional tone)
Workflow:
Thank you for holding. Your call is important to us.
A representative will be with you shortly.
Settings: Normal + Normal + NeutralTip: Keep hold messages calm and professional
Your request has been submitted successfully.
You will receive a confirmation email within 24 hours.
Thank you for your business.
Settings: Normal + Normal + NeutralExport: WAV format for best phone system compatibility

Best Practices

Text Formatting for Best Results

Use Proper Punctuation

Good:
Hello! Welcome to our store. How can I help you today?
Bad:
hello welcome to our store how can i help you today
Impact: Punctuation creates natural pauses and intonation

Avoid Special Characters

Good:
The price is twenty dollars.
Call us at 5-5-5, 1-2-3-4.
Bad:
The price is $20.
Call us at 555-1234.
Impact: Write numbers and symbols as words for better pronunciation

Break Long Sentences

Good:
We offer many services. These include consulting, training, and support.
Each service is customized to your needs.
Bad:
We offer many services including consulting training and support and
each service is customized to your specific needs and requirements.
Impact: Shorter sentences improve clarity and flow

Spell Out Abbreviations

Good:
The United States of America.
Doctor Smith will see you.
Bad:
The USA.
Dr. Smith will see you.
Impact: Spelling out ensures correct pronunciation

Quality Control Checklist

Before exporting final audio:
1

Pronunciation Check

  • All names pronounced correctly?
  • Technical terms handled well?
  • Numbers read naturally?
  • Acronyms spelled or spoken correctly?
2

Pacing Check

  • Speed appropriate for content?
  • Natural pauses between sentences?
  • Comfortable listening pace?
  • Suitable for target audience?
3

Tone Check

  • Mood matches content?
  • Emotional tone appropriate?
  • Professional/casual balance correct?
  • Voice type fits brand?
4

Technical Check

  • Audio plays smoothly?
  • No cuts or glitches?
  • Volume consistent?
  • Exported in correct format?

Naming Conventions

Develop a consistent naming system: Format: [Project]_[Type]_[Number]_[Language]_[Version] Examples:
Podcast_Intro_01_EN_v1
Course_Lesson_05_ES_v2  
Product_Demo_Main_FR_final
IVR_Menu_Main_EN_v3
Benefits:
  • Easy to sort and find files
  • Clear version tracking
  • Language identification
  • Professional organization

Optimization Tips

Speed Up Your Workflow

Instead of configuring settings repeatedly:
  1. Generate audio with perfect settings
  2. Name it “TEMPLATE - [Description]”
  3. For new content: Find template in history
  4. Check settings (they’re displayed)
  5. Configure VozCraft to match
  6. Generate new audio
Saves: 1-2 minutes per generation
Prepare all content before starting:
  1. Write all text in one document
  2. Run spell check
  3. Format numbers and abbreviations
  4. Split into segments if needed
  5. Copy/paste efficiently through VozCraft
Saves: Reduces context switching, improves focus
Use browser shortcuts:
  • Ctrl+A / Cmd+A: Select all text
  • Ctrl+C / Cmd+C: Copy
  • Ctrl+V / Cmd+V: Paste
  • Enter: Confirm rename modal
  • Esc: Cancel rename modal
Saves: Seconds per action, minutes over session

Maintain Consistency Across Projects

1

Document Your Standards

Create a voice style guide:
Company Voice Standards:
- Language: English (US)
- Voice Type: Normal
- Speed: Normal
- Mood: Neutral (default) or Serious (important announcements)
- Exceptions: Marketing uses Happy mood
2

Export Reference Audio

Create a reference file:
  1. Generate standard voice with sample text
  2. Name: “COMPANY_VOICE_STANDARD_REFERENCE”
  3. Export MP3 and transcript
  4. Share with team
  5. Use for comparison
3

Regular Quality Reviews

Schedule periodic reviews:
  • Compare new audio to reference
  • Ensure settings haven’t drifted
  • Update standards if needed
  • Train team members on standards

Troubleshooting Common Workflows

Problem: Generated audio sounds unnaturalSolutions:
  1. Check punctuation: Add periods, commas, questions marks
  2. Simplify text: Break long sentences into shorter ones
  3. Try different mood: Switch to Neutral if using extreme moods
  4. Adjust speed: Normal speed often sounds most natural
  5. Test different language variant: Try another accent
Problem: Can’t generate consistent resultsSolutions:
  1. Document settings: Write down exact settings used
  2. Export history JSON: Backup contains all settings
  3. Use history: Reference previous successful generations
  4. Check browser: Same browser produces consistent results
  5. Name systematically: Include settings in filename
Problem: Taking too long to produce contentSolutions:
  1. Batch preparation: Prepare all text before starting
  2. Use templates: Reference previous good settings
  3. Skip perfection: “Good enough” may be sufficient
  4. Learn shortcuts: Use keyboard navigation
  5. Optimize workflow: Follow workflows in this guide

Next Steps

Voice Settings Guide

Detailed guide for optimal voice configuration

Exporting Audio

Step-by-step guide for all export scenarios

Troubleshooting

Solutions to common problems

Build docs developers (and LLMs) love