Conversations

Overview

The Conversation class manages WebSocket-based real-time conversations with your AI agents. It handles audio streaming, message events, and the conversation lifecycle.

Basic Usage

Create and start a conversation:

from elevenlabs.client import ElevenLabs
from elevenlabs.conversational_ai.conversation import Conversation
from elevenlabs.conversational_ai.default_audio_interface import DefaultAudioInterface

elevenlabs = ElevenLabs(api_key="YOUR_API_KEY")

# Create audio interface
audio_interface = DefaultAudioInterface()

# Create conversation
conversation = Conversation(
    client=elevenlabs,
    agent_id="your-agent-id",
    requires_auth=True,
    audio_interface=audio_interface,
)

# Start conversation in background
conversation.start_session()

# ... conversation runs in background thread ...

# End conversation
conversation.end_session()

# Wait for cleanup
conversation_id = conversation.wait_for_session_end()
print(f"Conversation ID: {conversation_id}")

Constructor Parameters

client

ElevenLabs

required

The ElevenLabs client instance

agent_id

str

required

The ID of the agent to converse with

requires_auth

bool

required

Whether the agent requires authentication

audio_interface

AudioInterface

required

Audio interface for input/output handling

user_id

str

Optional user identifier for the conversation

config

ConversationInitiationData

Configuration options for the conversation

client_tools

ClientTools

Custom tools the agent can call during conversation

Event Callbacks

def on_agent_response(response: str):
    print(f"Agent said: {response}")

def on_user_transcript(transcript: str):
    print(f"User said: {transcript}")

def on_latency(latency_ms: int):
    print(f"Latency: {latency_ms}ms")

conversation = Conversation(
    client=elevenlabs,
    agent_id="your-agent-id",
    requires_auth=True,
    audio_interface=audio_interface,
    callback_agent_response=on_agent_response,
    callback_user_transcript=on_user_transcript,
    callback_latency_measurement=on_latency,
)

Available Callbacks

callback_agent_response

Callable[[str], None]

Called when the agent produces a complete response

callback_agent_response_correction

Callable[[str, str], None]

Called when the agent corrects a previous response. First arg is original, second is corrected.

callback_agent_chat_response_part

Callable[[str, AgentChatResponsePartType], None]

Called for streaming text response chunks. Part type is START, DELTA, or STOP.

callback_user_transcript

Callable[[str], None]

Called when user speech is transcribed

callback_latency_measurement

Callable[[int], None]

Called with latency measurements in milliseconds

callback_audio_alignment

Callable[[AudioEventAlignment], None]

Called with character-level audio alignment data

callback_end_session

Callable[[], None]

Called when the conversation session ends

Streaming Response Parts

Handle streaming text responses from the agent:

from elevenlabs.conversational_ai.conversation import AgentChatResponsePartType

def on_chat_part(text: str, part_type: AgentChatResponsePartType):
    if part_type == AgentChatResponsePartType.START:
        print("Agent starting response...")
    elif part_type == AgentChatResponsePartType.DELTA:
        print(text, end="", flush=True)
    elif part_type == AgentChatResponsePartType.STOP:
        print("\nAgent finished response.")

conversation = Conversation(
    client=elevenlabs,
    agent_id="your-agent-id",
    requires_auth=True,
    audio_interface=audio_interface,
    callback_agent_chat_response_part=on_chat_part,
)

Audio Alignment

Get character-level timing information for agent audio:

from elevenlabs.conversational_ai.conversation import AudioEventAlignment

def on_alignment(alignment: AudioEventAlignment):
    for i, char in enumerate(alignment.chars):
        start_ms = alignment.char_start_times_ms[i]
        duration_ms = alignment.char_durations_ms[i]
        print(f"{char}: {start_ms}ms (+{duration_ms}ms)")

conversation = Conversation(
    client=elevenlabs,
    agent_id="your-agent-id",
    requires_auth=True,
    audio_interface=audio_interface,
    callback_audio_alignment=on_alignment,
)

Sending Messages

Send text messages to the agent programmatically:

conversation.start_session()

# Send a text message from the user
conversation.send_user_message("What is the weather like today?")

# Send contextual updates (non-interrupting)
conversation.send_contextual_update("User is looking at the weather page")

# Register user activity to prevent timeout
conversation.register_user_activity()

Message Methods

send_user_message

method

Send a text message from the user to the agent

conversation.send_user_message(text: str)

send_contextual_update

method

Send non-interrupting contextual information to update conversation state

conversation.send_contextual_update(text: str)

register_user_activity

method

Send a ping to prevent session timeout

conversation.register_user_activity()

Configuration Options

Customize conversation behavior with ConversationInitiationData:

from elevenlabs.conversational_ai.conversation import ConversationInitiationData

config = ConversationInitiationData(
    extra_body={"custom_param": "value"},
    conversation_config_override={
        "language": "en",
        "max_duration_seconds": 300,
    },
    dynamic_variables={
        "user_name": "John",
        "account_type": "premium",
    },
    user_id="user_12345",
)

conversation = Conversation(
    client=elevenlabs,
    agent_id="your-agent-id",
    requires_auth=True,
    audio_interface=audio_interface,
    config=config,
)

Configuration Fields

extra_body

dict

Additional custom parameters passed to the LLM

conversation_config_override

dict

Override default conversation configuration settings

dynamic_variables

dict

Dynamic variables accessible to the agent during conversation

user_id

str

Identifier for the user in this conversation

Session Management

Start Session

Starts the conversation in a background thread:

conversation.start_session()
# Returns immediately, conversation runs in background

End Session

Ends the conversation and cleans up resources:

conversation.end_session()

Wait for Session End

Blocks until the conversation completes:

conversation.end_session()
conversation_id = conversation.wait_for_session_end()
print(f"Conversation {conversation_id} has ended")

Call end_session() before wait_for_session_end(), otherwise it will block indefinitely.

Async Conversations

Use AsyncConversation for async/await workflows:

import asyncio
from elevenlabs.client import AsyncElevenLabs
from elevenlabs.conversational_ai.conversation import AsyncConversation
from elevenlabs.conversational_ai.default_audio_interface import AsyncDefaultAudioInterface

elevenlabs = AsyncElevenLabs(api_key="YOUR_API_KEY")

async def on_agent_response(response: str):
    print(f"Agent: {response}")

async def on_user_transcript(transcript: str):
    print(f"User: {transcript}")

async def main():
    audio_interface = AsyncDefaultAudioInterface()
    
    conversation = AsyncConversation(
        client=elevenlabs,
        agent_id="your-agent-id",
        requires_auth=True,
        audio_interface=audio_interface,
        callback_agent_response=on_agent_response,
        callback_user_transcript=on_user_transcript,
    )
    
    await conversation.start_session()
    
    # Send a message
    await conversation.send_user_message("Hello!")
    
    # Wait a bit
    await asyncio.sleep(10)
    
    await conversation.end_session()
    conversation_id = await conversation.wait_for_session_end()
    print(f"Conversation {conversation_id} ended")

asyncio.run(main())

All async callbacks must be async functions. Use AsyncAudioInterface instead of AudioInterface.

Error Handling

Handle connection and runtime errors:

import logging

logging.basicConfig(level=logging.INFO)

try:
    conversation.start_session()
    conversation.wait_for_session_end()
except RuntimeError as e:
    print(f"Conversation error: {e}")
except Exception as e:
    print(f"Unexpected error: {e}")
finally:
    conversation.end_session()

Complete Example

from elevenlabs.client import ElevenLabs
from elevenlabs.conversational_ai.conversation import (
    Conversation,
    ConversationInitiationData,
    ClientTools,
)
from elevenlabs.conversational_ai.default_audio_interface import DefaultAudioInterface

elevenlabs = ElevenLabs(api_key="YOUR_API_KEY")

# Set up callbacks
def on_agent_response(response: str):
    print(f"Agent: {response}")

def on_user_transcript(transcript: str):
    print(f"User: {transcript}")

def on_latency(ms: int):
    print(f"Latency: {ms}ms")

# Set up tools
client_tools = ClientTools()

def get_weather(params):
    location = params.get("location", "Unknown")
    return f"Weather in {location}: Sunny, 72°F"

client_tools.register("get_weather", get_weather, is_async=False)

# Configure conversation
config = ConversationInitiationData(
    dynamic_variables={"user_name": "Alice"},
)

# Create and start conversation
audio_interface = DefaultAudioInterface()

conversation = Conversation(
    client=elevenlabs,
    agent_id="your-agent-id",
    requires_auth=True,
    audio_interface=audio_interface,
    config=config,
    client_tools=client_tools,
    callback_agent_response=on_agent_response,
    callback_user_transcript=on_user_transcript,
    callback_latency_measurement=on_latency,
)

conversation.start_session()

# Conversation runs until user ends it
input("Press Enter to end conversation...")

conversation.end_session()
conversation_id = conversation.wait_for_session_end()
print(f"Conversation {conversation_id} ended")

Getting Started

Core Features

Conversational AI

Advanced Features

Guides

Overview

Basic Usage

Constructor Parameters

Event Callbacks

Available Callbacks

Streaming Response Parts

Audio Alignment

Sending Messages

Message Methods

Configuration Options

Configuration Fields

Session Management

Start Session

End Session

Wait for Session End

Async Conversations

Error Handling

Complete Example

Build docs developers (and LLMs) love

Getting Started

Core Features

Conversational AI

Advanced Features

Guides

​Overview

​Basic Usage

​Constructor Parameters

​Event Callbacks

​Available Callbacks

​Streaming Response Parts

​Audio Alignment

​Sending Messages

​Message Methods

​Configuration Options

​Configuration Fields

​Session Management

​Start Session

​End Session

​Wait for Session End

​Async Conversations

​Error Handling

​Complete Example

Build docs developers (and LLMs) love

Overview

Basic Usage

Constructor Parameters

Event Callbacks

Available Callbacks

Streaming Response Parts

Audio Alignment

Sending Messages

Message Methods

Configuration Options

Configuration Fields

Session Management

Start Session

End Session

Wait for Session End

Async Conversations

Error Handling

Complete Example