Conversation

Synchronous conversational AI session for real-time voice conversations with an agent.

BETA: This API is subject to change without regard to backwards compatibility.

Constructor

Conversation(
    client: BaseElevenLabs,
    agent_id: str,
    user_id: Optional[str] = None,
    *,
    requires_auth: bool,
    audio_interface: AudioInterface,
    config: Optional[ConversationInitiationData] = None,
    client_tools: Optional[ClientTools] = None,
    callback_agent_response: Optional[Callable[[str], None]] = None,
    callback_agent_response_correction: Optional[Callable[[str, str], None]] = None,
    callback_agent_chat_response_part: Optional[Callable[[str, AgentChatResponsePartType], None]] = None,
    callback_user_transcript: Optional[Callable[[str], None]] = None,
    callback_latency_measurement: Optional[Callable[[int], None]] = None,
    callback_audio_alignment: Optional[Callable[[AudioEventAlignment], None]] = None,
    callback_end_session: Optional[Callable] = None,
    on_prem_config: Optional[OnPremInitiationData] = None,
)

client

BaseElevenLabs

required

The ElevenLabs client to use for the conversation.

agent_id

str

required

The ID of the agent to converse with.

user_id

str

The ID of the user conversing with the agent.

requires_auth

bool

required

Whether the agent requires authentication.

audio_interface

AudioInterface

required

The audio interface to use for input and output.

config

ConversationInitiationData

Configuration options for the conversation including extra_body, conversation_config_override, dynamic_variables, and user_id.

client_tools

ClientTools

Client-side tools that can be called by the agent during the conversation.

callback_agent_response

Callable[[str], None]

Callback function invoked when the agent provides a response.

callback_agent_response_correction

Callable[[str, str], None]

Callback for agent response corrections. First argument is the original response (previously given to callback_agent_response), second argument is the corrected response.

callback_agent_chat_response_part

Callable[[str, AgentChatResponsePartType], None]

Callback for streaming text response chunks. First argument is the text chunk, second argument is the type (START, DELTA, or STOP).

callback_user_transcript

Callable[[str], None]

Callback function invoked when user speech is transcribed.

callback_latency_measurement

Callable[[int], None]

Callback for latency measurements in milliseconds.

callback_audio_alignment

Callable[[AudioEventAlignment], None]

Callback for audio alignment data with character-level timing information.

callback_end_session

Callable

Callback function invoked when the session ends.

on_prem_config

OnPremInitiationData

Configuration options for on-premises deployment.

Methods

start_session

conversation.start_session()

Starts the conversation session. Will run in background thread until end_session is called.

end_session

conversation.end_session()

Ends the conversation session and cleans up resources.

wait_for_session_end

conversation_id = conversation.wait_for_session_end()

Waits for the conversation session to end. You must call end_session before calling this method, otherwise it will block. Returns: The conversation ID, if available.

send_user_message

conversation.send_user_message(text: str)

Send a text message from the user to the agent.

text

str

required

The text message to send to the agent.

Raises: RuntimeError if the session is not active or websocket is not connected.

register_user_activity

conversation.register_user_activity()

Register user activity to prevent session timeout. This sends a ping to the orchestrator to reset the timeout timer. Raises: RuntimeError if the session is not active or websocket is not connected.

send_contextual_update

conversation.send_contextual_update(text: str)

Send a contextual update to the conversation. Contextual updates are non-interrupting content that is sent to the server to update the conversation state without directly prompting the agent.

text

str

required

The contextual information to send to the conversation.

Raises: RuntimeError if the session is not active or websocket is not connected.

Example

from elevenlabs import ElevenLabs
from elevenlabs.conversational_ai import Conversation, DefaultAudioInterface

client = ElevenLabs(api_key="your-api-key")

conversation = Conversation(
    client=client,
    agent_id="your-agent-id",
    requires_auth=True,
    audio_interface=DefaultAudioInterface(),
    callback_agent_response=lambda text: print(f"Agent: {text}"),
    callback_user_transcript=lambda text: print(f"User: {text}"),
)

conversation.start_session()

# Send a text message during the conversation
conversation.send_user_message("Hello, how are you?")

# Wait for user to end the conversation
input("Press Enter to end conversation...")

conversation.end_session()
conversation_id = conversation.wait_for_session_end()
print(f"Conversation ID: {conversation_id}")

AsyncConversation

Asynchronous conversational AI session for real-time voice conversations with an agent.

BETA: This API is subject to change without regard to backwards compatibility.

Constructor

AsyncConversation(
    client: BaseElevenLabs,
    agent_id: str,
    user_id: Optional[str] = None,
    *,
    requires_auth: bool,
    audio_interface: AsyncAudioInterface,
    config: Optional[ConversationInitiationData] = None,
    client_tools: Optional[ClientTools] = None,
    callback_agent_response: Optional[Callable[[str], Awaitable[None]]] = None,
    callback_agent_response_correction: Optional[Callable[[str, str], Awaitable[None]]] = None,
    callback_agent_chat_response_part: Optional[Callable[[str, AgentChatResponsePartType], Awaitable[None]]] = None,
    callback_user_transcript: Optional[Callable[[str], Awaitable[None]]] = None,
    callback_latency_measurement: Optional[Callable[[int], Awaitable[None]]] = None,
    callback_audio_alignment: Optional[Callable[[AudioEventAlignment], Awaitable[None]]] = None,
    callback_end_session: Optional[Callable[[], Awaitable[None]]] = None,
    on_prem_config: Optional[OnPremInitiationData] = None,
)

client

BaseElevenLabs

required

The ElevenLabs client to use for the conversation.

agent_id

str

required

The ID of the agent to converse with.

user_id

str

The ID of the user conversing with the agent.

requires_auth

bool

required

Whether the agent requires authentication.

audio_interface

AsyncAudioInterface

required

The async audio interface to use for input and output.

config

ConversationInitiationData

Configuration options for the conversation.

client_tools

ClientTools

Client-side tools that can be called by the agent during the conversation.

callback_agent_response

Callable[[str], Awaitable[None]]

Async callback function invoked when the agent provides a response.

callback_agent_response_correction

Callable[[str, str], Awaitable[None]]

Async callback for agent response corrections.

callback_agent_chat_response_part

Callable[[str, AgentChatResponsePartType], Awaitable[None]]

Async callback for streaming text response chunks.

callback_user_transcript

Callable[[str], Awaitable[None]]

Async callback function invoked when user speech is transcribed.

callback_latency_measurement

Callable[[int], Awaitable[None]]

Async callback for latency measurements in milliseconds.

callback_audio_alignment

Callable[[AudioEventAlignment], Awaitable[None]]

Async callback for audio alignment data with character-level timing.

callback_end_session

Callable[[], Awaitable[None]]

Async callback function invoked when the session ends.

on_prem_config

OnPremInitiationData

Configuration options for on-premises deployment.

Methods

All methods are async and should be awaited.

start_session

await conversation.start_session()

Starts the conversation session. Will run in background task until end_session is called.

end_session

await conversation.end_session()

Ends the conversation session and cleans up resources.

wait_for_session_end

conversation_id = await conversation.wait_for_session_end()

Waits for the conversation session to end. You must call end_session before calling this method, otherwise it will block. Returns: The conversation ID, if available.

send_user_message

await conversation.send_user_message(text: str)

Send a text message from the user to the agent.

text

str

required

The text message to send to the agent.

Raises: RuntimeError if the session is not active or websocket is not connected.

register_user_activity

await conversation.register_user_activity()

Register user activity to prevent session timeout. Raises: RuntimeError if the session is not active or websocket is not connected.

send_contextual_update

await conversation.send_contextual_update(text: str)

Send a contextual update to the conversation.

text

str

required

The contextual information to send to the conversation.

Raises: RuntimeError if the session is not active or websocket is not connected.

Example

import asyncio
from elevenlabs import AsyncElevenLabs
from elevenlabs.conversational_ai import AsyncConversation, AsyncDefaultAudioInterface

async def main():
    client = AsyncElevenLabs(api_key="your-api-key")
    
    conversation = AsyncConversation(
        client=client,
        agent_id="your-agent-id",
        requires_auth=True,
        audio_interface=AsyncDefaultAudioInterface(),
        callback_agent_response=lambda text: print(f"Agent: {text}"),
        callback_user_transcript=lambda text: print(f"User: {text}"),
    )
    
    await conversation.start_session()
    
    # Send a text message during the conversation
    await conversation.send_user_message("Hello, how are you?")
    
    # Simulate conversation time
    await asyncio.sleep(30)
    
    await conversation.end_session()
    conversation_id = await conversation.wait_for_session_end()
    print(f"Conversation ID: {conversation_id}")

asyncio.run(main())

Supporting Classes

ConversationInitiationData

Configuration options for the Conversation.

ConversationInitiationData(
    extra_body: Optional[dict] = None,
    conversation_config_override: Optional[dict] = None,
    dynamic_variables: Optional[dict] = None,
    user_id: Optional[str] = None,
)

extra_body

dict

Additional custom data to include in the conversation initiation.

conversation_config_override

dict

Configuration overrides for the conversation.

dynamic_variables

dict

Dynamic variables to use during the conversation.

user_id

str

The ID of the user conversing with the agent.

AudioEventAlignment

Audio alignment data containing character-level timing information.

@dataclass
class AudioEventAlignment:
    chars: List[str]
    char_start_times_ms: List[int]
    char_durations_ms: List[int]

chars

List[str]

List of characters in the audio.

char_start_times_ms

List[int]

Start times for each character in milliseconds.

char_durations_ms

List[int]

Duration of each character in milliseconds.

AgentChatResponsePartType

Enum for streaming text response types.

class AgentChatResponsePartType(str, Enum):
    START = "start"  # Beginning of a response
    DELTA = "delta"  # Text chunk in the middle of a response
    STOP = "stop"    # End of a response

Client

Text to Speech

Voices

Conversational AI

Audio Processing

History & Models

Account & Usage

Conversation

Conversation

Constructor

Methods

start_session

end_session

wait_for_session_end

send_user_message

register_user_activity

send_contextual_update

Example

AsyncConversation

Constructor

Methods

start_session

end_session

wait_for_session_end

send_user_message

register_user_activity

send_contextual_update

Example

Supporting Classes

ConversationInitiationData

AudioEventAlignment

AgentChatResponsePartType

Build docs developers (and LLMs) love

Client

Text to Speech

Voices

Conversational AI

Audio Processing

History & Models

Account & Usage

​Conversation

​Constructor

​Methods

​start_session

​end_session

​wait_for_session_end

​send_user_message

​register_user_activity

​send_contextual_update

​Example

​AsyncConversation

​Constructor

​Methods

​start_session

​end_session

​wait_for_session_end

​send_user_message

​register_user_activity

​send_contextual_update

​Example

​Supporting Classes

​ConversationInitiationData

​AudioEventAlignment

​AgentChatResponsePartType

Build docs developers (and LLMs) love

Conversation

Constructor

Methods

start_session

end_session

wait_for_session_end

send_user_message

register_user_activity

send_contextual_update

Example

AsyncConversation

Constructor

Methods

start_session

end_session

wait_for_session_end

send_user_message

register_user_activity

send_contextual_update

Example

Supporting Classes

ConversationInitiationData

AudioEventAlignment

AgentChatResponsePartType