ElevenLabs Integration

Overview

ElevenLabs generates voice narration for the Dream Foundry winner showcase. After selecting a winning agent, ElevenLabs creates a professional audio announcement that summarizes the competition and presents the results.

ElevenLabs integration is optional. If not configured, Dream Foundry completes without audio generation.

Why Voice Narration?

The Dream Awakening phase (Phase 5) presents the winning agent to the world. Voice narration adds:

Professionalism: Polished presentation for stakeholders
Accessibility: Audio format for different consumption preferences
Marketing: Shareable content for demos and showcases
Documentation: Permanent record of competition results

Setup and Configuration

Create ElevenLabs Account

Get API Key

Navigate to Profile → API Keys and generate a new key.

Clone Your Voice (Optional)

For personalized narration, clone your voice:

Go to Voice Lab → Add Voice
Upload 1-2 minutes of clean audio samples
Wait for processing (~10 minutes)
Copy the voice ID (looks like EXAVITQu4vr4xnSDxMaL)

Configure Environment

Add credentials to .env:

ELEVENLABS_API_KEY=your_api_key_here
ELEVENLABS_CLONED_VOICE_ID=your_voice_id_here  # Optional

How It Works

Script Generation

ElevenLabs integration starts by generating a narration script:

src/elevenlabs.py

def generate_winner_script(
    winner_name: str,
    winner_score: float,
    objective: str,
) -> str:
    """Generate the narration script for the winner announcement."""
    return f"""The winner is {winner_name} with {winner_score:.1f} points.

After competing in the arena, {winner_name} emerged as the clear choice for the objective: {objective}.

The winning code has been published and is ready for review. Another dream successfully forged."""

Audio Generation

The script is converted to speech using ElevenLabs API:

src/elevenlabs.py

def generate_audio(
    text: str,
    output_path: Optional[str] = None,
    voice_id: Optional[str] = None,
) -> Optional[bytes]:
    """Generate audio from text using ElevenLabs API."""
    api_key, default_voice_id = get_elevenlabs_config()
    
    if not api_key:
        return None
    
    # Use provided voice_id, or env var, or ElevenLabs default "Rachel"
    voice = voice_id or default_voice_id or "EXAVITQu4vr4xnSDxMaL"
    
    response = requests.post(
        f"https://api.elevenlabs.io/v1/text-to-speech/{voice}",
        headers={
            "xi-api-key": api_key,
            "Content-Type": "application/json",
        },
        json={
            "text": text,
            "model_id": "eleven_monolingual_v1",
            "voice_settings": {
                "stability": 0.5,
                "similarity_boost": 0.75,
            }
        },
        timeout=30,
    )
    
    if response.status_code != 200:
        print(f"[ElevenLabs] Error: {response.status_code}")
        return None
    
    return response.content

Voice Settings

The voice generation uses these settings:

Setting	Value	Effect
stability	0.5	Balance between consistency and expressiveness
similarity_boost	0.75	Higher = closer to original voice (for clones)
model	eleven_monolingual_v1	English-only model, high quality

For multilingual support, use eleven_multilingual_v2 model instead.

Usage Examples

Basic Usage

Generate audio for the winner:

from src.elevenlabs import generate_winner_audio

audio_bytes = generate_winner_audio(
    winner_name="Agent Gamma (The Insider)",
    winner_score=94.4,
    objective="Generate weekly AI events for Discord",
    output_path="artifacts/winner_announcement.mp3",
)

if audio_bytes:
    print("Audio generated successfully!")
else:
    print("ElevenLabs not configured or request failed")

Custom Script

Generate audio from a custom script:

from src.elevenlabs import generate_audio

script = """
Welcome to the Dream Foundry results!

Today, five AI agents competed to generate the perfect Discord post.
After rigorous evaluation, we have a winner.

Congratulations to Agent Gamma!
"""

audio = generate_audio(
    text=script,
    output_path="artifacts/custom_announcement.mp3",
)

Using a Specific Voice

Use a different voice for narration:

from src.elevenlabs import generate_audio

# Use ElevenLabs preset voices
VOICES = {
    "rachel": "EXAVITQu4vr4xnSDxMaL",    # Female, clear
    "adam": "pNInz6obpgDQGcFmaJgB",      # Male, deep
    "bella": "EXAVITQu4vr4xnSDxMaL",     # Female, soft
}

audio = generate_audio(
    text=script,
    voice_id=VOICES["adam"],
    output_path="artifacts/adam_voice.mp3",
)

Configuration Check

Check if ElevenLabs is properly configured:

src/elevenlabs.py

def is_configured() -> bool:
    """Check if ElevenLabs is properly configured."""
    api_key, _ = get_elevenlabs_config()
    return api_key is not None

Usage:

from src.elevenlabs import is_configured

if is_configured():
    print("ElevenLabs ready!")
else:
    print("Add ELEVENLABS_API_KEY to .env")

Output Format

Generated audio files are:

Format: MP3
Quality: 128 kbps (default)
Sample Rate: 44.1 kHz
Channels: Mono
Duration: ~20-40 seconds for winner announcements

Integration with Streamlit UI

The Streamlit app can play generated audio:

app.py

import streamlit as st
from src.elevenlabs import generate_winner_audio, is_configured

if is_configured():
    st.header("🎤 Winner Announcement")
    
    audio_bytes = generate_winner_audio(
        winner_name=winner_name,
        winner_score=winner_score,
        objective=objective,
    )
    
    if audio_bytes:
        st.audio(audio_bytes, format="audio/mp3")
        st.success("Audio generated successfully!")
else:
    st.warning("ElevenLabs not configured. Add ELEVENLABS_API_KEY to .env")

Best Practices

Keep scripts concise

ElevenLabs works best with scripts under 1000 characters:

# Good: Clear and concise
script = "The winner is Agent Gamma with 94.4 points."

# Avoid: Too verbose
script = "After extensive deliberation and comprehensive analysis..."

Use punctuation for pacing

Periods and commas control pacing:

# Good: Natural pauses
script = "The winner is Agent Gamma. Score: 94.4 points."

# Avoid: Run-on sentence
script = "The winner is Agent Gamma score 94.4 points"

Test voice settings

Experiment with stability and similarity:

# More stable (consistent but less expressive)
voice_settings = {"stability": 0.8, "similarity_boost": 0.5}

# More expressive (varied but less consistent)
voice_settings = {"stability": 0.3, "similarity_boost": 0.9}

Handle API failures gracefully

Always check for None returns:

audio = generate_audio(script)
if audio:
    save_to_file(audio)
else:
    log("Skipping audio generation")

Troubleshooting

API Key Invalid

Error: [ElevenLabs] Error: 401 Solution:

Verify API key is correct in .env
Check key hasn’t expired at elevenlabs.io/profile
Ensure key doesn’t start with your_ (placeholder)

Voice ID Not Found

Error: [ElevenLabs] Error: 404 Solution:

Verify voice ID exists in your ElevenLabs account
Use default voice by omitting ELEVENLABS_CLONED_VOICE_ID
Check voice ID format (should be alphanumeric, ~20 characters)

Timeout Errors

Error: Request times out after 30 seconds Solution:

Check network connectivity
Try a shorter script (under 500 characters)
Increase timeout in src/elevenlabs.py:86

Rate Limiting

Error: [ElevenLabs] Error: 429 - Too many requests Solution:

Free tier: 10,000 characters/month
Wait for rate limit reset (check response headers)
Upgrade to paid plan for higher limits

Cost Considerations

Character Usage

ElevenLabs charges by character count:

script = generate_winner_script("Agent Gamma", 94.4, objective)
char_count = len(script)  # ~150 characters
print(f"This will use {char_count} characters")

Pricing Tiers (as of 2026)

Plan	Characters/Month	Cost
Free	10,000	$0
Starter	30,000	$5
Creator	100,000	$22
Pro	500,000	$99

Winner announcements use ~150 characters each. On the free tier, you can generate ~60 announcements per month.

Get Started

Core Concepts

Guides

Integrations

Overview

Why Voice Narration?

Setup and Configuration

How It Works

Script Generation

Audio Generation

Voice Settings

Usage Examples

Basic Usage

Custom Script

Using a Specific Voice

Configuration Check

Output Format

Integration with Streamlit UI

Best Practices

Troubleshooting

API Key Invalid

Voice ID Not Found

Timeout Errors

Rate Limiting

Cost Considerations

Character Usage

Pricing Tiers (as of 2026)

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

Integrations

​Overview

​Why Voice Narration?

​Setup and Configuration

​How It Works

​Script Generation

​Audio Generation

​Voice Settings

​Usage Examples

​Basic Usage

​Custom Script

​Using a Specific Voice

​Configuration Check

​Output Format

​Integration with Streamlit UI

​Best Practices

​Troubleshooting

​API Key Invalid

​Voice ID Not Found

​Timeout Errors

​Rate Limiting

​Cost Considerations

​Character Usage

​Pricing Tiers (as of 2026)

​Related

Build docs developers (and LLMs) love

Overview

Why Voice Narration?

Setup and Configuration

How It Works

Script Generation

Audio Generation

Voice Settings

Usage Examples

Basic Usage

Custom Script

Using a Specific Voice

Configuration Check

Output Format

Integration with Streamlit UI

Best Practices

Troubleshooting

API Key Invalid

Voice ID Not Found

Timeout Errors

Rate Limiting

Cost Considerations

Character Usage

Pricing Tiers (as of 2026)

Related