Using VozCraft - Complete Guide
This comprehensive guide covers complete workflows for using VozCraft effectively, from basic generation to advanced techniques for professional audio production. Whether you’re creating content for education, business, or entertainment, this guide will help you maximize VozCraft’s capabilities.Application Workflow Overview
VozCraft follows a simple, intuitive workflow:Basic Workflows
Workflow 1: Quick Audio Generation
Use Case: Generate a single audio file quickly Time Required: 2-3 minutesSelect Language
Keep Default Settings
- Voice Type: Normal
- Speed: Normal
- Mood: Neutral
Workflow 2: Creating Multiple Audio Files
Use Case: Generate a series of related audio files (e.g., course lessons, podcast episodes) Time Required: 5-10 minutes per fileConfigure Base Settings
- Language: Your content language
- Voice Type: Choose based on content style
- Speed: Consider your audience
- Mood: Match your content tone
Generate First Audio
- Paste first text
- Click Generate Audio
- Listen to verify settings
- Adjust if needed
Generate Subsequent Files
- Clear text area
- Paste next content
- Generate (settings persist)
- Rename in history
Workflow 3: Testing Different Voice Options
Use Case: Experiment with different voices to find the best fit Time Required: 10-15 minutesPrepare Test Text
- Typical vocabulary from your content
- Natural sentence structure
- Any special terms or names
Test Language Variants
- English (US)
- English (UK)
- English (AU)
- Clarity
- Accent appropriateness
- Pronunciation accuracy
Test Voice Types
- Generate with Normal Voice
- Generate with High-pitched Voice
- Which sounds more professional?
- Which matches your brand?
- Which is more engaging?
Test Moods
- Neutral (baseline)
- Happy (if content is positive)
- Serious (if content is formal)
- Energetic (if content is dynamic)
Select Winner
- Sounds most natural
- Matches your content tone
- Appeals to your audience
- Meets your quality standards
Advanced Workflows
Workflow 4: Long-Form Content Production
Use Case: Creating audiobook chapters, long articles, or extensive documentation Challenges: 5,000 character limit, maintaining consistency Solution: Split content intelligently and merge laterPrepare Your Content
- Write complete content in text editor
- Check total length (character count)
- Plan splits at natural breakpoints:
- End of paragraphs
- Section breaks
- Logical thought boundaries
numberOfSegments = totalCharacters / 4500(Use 4,500 to leave buffer for safety)Split Text Strategically
- End at sentence boundaries
- Don’t exceed 5,000 characters
- Have slight overlap for smooth merging
Generate All Segments
- Paste text into VozCraft
- Verify character count is under 5,000
- Generate audio
- Rename clearly: “Chapter_3_Segment_1”, “Chapter_3_Segment_2”, etc.
- Export as WAV (for later editing)
Export and Organize
- Export all segments as WAV
- Export history JSON as backup
- Organize files in project folder:
Workflow 5: Multilingual Content Creation
Use Case: Creating the same content in multiple languages Time Required: 5-10 minutes per languagePrepare Translations
- Use professional translation service
- Or Google Translate for basic needs
- Verify translations with native speakers if possible
Create Reference Matrix
| Language | Voice Type | Speed | Mood | Notes |
|---|---|---|---|---|
| English (US) | Normal | Normal | Neutral | Default |
| Español (México) | Normal | Normal | Neutral | Match English |
| Français | Normal | Slow | Neutral | Slower for clarity |
Generate Each Language
- Select language in VozCraft
- Paste translated text
- Adjust settings per your matrix
- Generate and listen
- Rename: “ProductDemo_EN”, “ProductDemo_ES”, etc.
Workflow 6: Voice Characterization
Use Case: Creating distinct character voices for storytelling, role-play, or educational scenarios Example: Creating a story with 3 charactersDefine Character Profiles
- Voice: Normal
- Speed: Normal
- Mood: Neutral
- Purpose: Authoritative, clear
- Voice: High-pitched
- Speed: Fast
- Mood: Happy
- Purpose: Energetic, youthful
- Voice: Normal
- Speed: Slow
- Mood: Serious
- Purpose: Thoughtful, experienced
Generate Each Voice
- Configure settings for that character
- Generate audio
- Rename with character name and sequence number
- Export as WAV for editing
Specialized Use Cases
Use Case: Language Learning Content
Goal: Create audio for language learners with optimal clarity Recommended Settings:- Speed: Slow or Very Slow (0.75x or 0.50x)
- Voice Type: Normal (clearer pronunciation)
- Mood: Neutral (no emotional distraction)
- Approach: Pronunciation practice first, comprehension practice later
Pronunciation Practice
Pronunciation Practice
- Generate with maximum clarity (Very Slow)
- Export as MP3
- Use in flashcard apps or learning materials
Comprehension Practice
Comprehension Practice
- Generate at normal conversational speed
- Test learner comprehension
- Generate slow version for review if needed
Progressive Difficulty
Progressive Difficulty
- Version A: Very Slow (0.50x) - Beginners
- Version B: Slow (0.75x) - Intermediate
- Version C: Normal (1.00x) - Advanced
- Learners progress through versions
- Builds confidence gradually
- Accommodates different skill levels
Use Case: Podcast Intro/Outro
Goal: Create professional podcast intro and outro segments Recommended Settings:- Intro: High-pitched + Fast + Enthusiastic (energetic welcome)
- Outro: Normal + Normal + Neutral (professional closing)
Generate with Appropriate Mood
- Intro: Use Enthusiastic or Happy for energy
- Outro: Use Neutral for professional close
Enhance in Audio Editor
- Add intro/outro music
- Apply compression for consistency
- Add reverb for polish (subtle)
- Normalize volume to -16 LUFS (podcast standard)
- Export final versions
Use Case: IVR / Phone System
Goal: Create phone system prompts and menus Recommended Settings:- Voice Type: Normal (professional)
- Speed: Slow (clarity over phone)
- Mood: Neutral or Serious (professional tone)
Main Menu
Main Menu
Hold Messages
Hold Messages
Confirmation Messages
Confirmation Messages
Best Practices
Text Formatting for Best Results
Use Proper Punctuation
Avoid Special Characters
Break Long Sentences
Spell Out Abbreviations
Quality Control Checklist
Before exporting final audio:Pronunciation Check
- All names pronounced correctly?
- Technical terms handled well?
- Numbers read naturally?
- Acronyms spelled or spoken correctly?
Pacing Check
- Speed appropriate for content?
- Natural pauses between sentences?
- Comfortable listening pace?
- Suitable for target audience?
Tone Check
- Mood matches content?
- Emotional tone appropriate?
- Professional/casual balance correct?
- Voice type fits brand?
Naming Conventions
Develop a consistent naming system: Format:[Project]_[Type]_[Number]_[Language]_[Version]
Examples:
- Easy to sort and find files
- Clear version tracking
- Language identification
- Professional organization
Optimization Tips
Speed Up Your Workflow
Use History as Templates
Use History as Templates
- Generate audio with perfect settings
- Name it “TEMPLATE - [Description]”
- For new content: Find template in history
- Check settings (they’re displayed)
- Configure VozCraft to match
- Generate new audio
Batch Text Preparation
Batch Text Preparation
- Write all text in one document
- Run spell check
- Format numbers and abbreviations
- Split into segments if needed
- Copy/paste efficiently through VozCraft
Keyboard Shortcuts
Keyboard Shortcuts
- Ctrl+A / Cmd+A: Select all text
- Ctrl+C / Cmd+C: Copy
- Ctrl+V / Cmd+V: Paste
- Enter: Confirm rename modal
- Esc: Cancel rename modal
Maintain Consistency Across Projects
Export Reference Audio
- Generate standard voice with sample text
- Name: “COMPANY_VOICE_STANDARD_REFERENCE”
- Export MP3 and transcript
- Share with team
- Use for comparison
