WebSocket Endpoint

Endpoint

GET /ws

Establishes a WebSocket connection for real-time audio streaming and language detection.

Connection Flow

Client initiates WebSocket connection to /ws
Server accepts connection and assigns a unique connection_id
Client sends audio data as binary chunks
Server buffers audio until minimum size is reached (20,000 bytes)
Server processes audio and returns language detection result
Connection closes after sending response

Message Protocol

Client to Server

Send raw audio data as binary WebSocket frames:

audio_data

bytes

required

Audio data in MP4 format. The server buffers chunks until at least 20,000 bytes are received (approximately 1 second of audio).

Audio Requirements

Format: MP4 container with audio codec
Minimum size: 20,000 bytes
Recommended length: 4-15 seconds
Bits per second: 16,000

Server to Client

The server sends JSON responses in two scenarios:

Success Response

{
  "status": "success",
  "data": {
    "language": "en",
    "confidence": 0.9,
    "processing_time": 2.34,
    "connection_id": "a1b2c3d4"
  },
  "timestamp": "2026-03-08T10:30:45.123456",
  "connection_id": "a1b2c3d4"
}

status

string

required

Always "success" for successful detection

data

object

required

Language detection results

Show data properties

language

string

required

ISO 639-1 language code (e.g., "en", "es", "fr")

confidence

number

required

Confidence score (currently fixed at 0.9)

processing_time

number

required

Time taken to process the audio in seconds

connection_id

string

required

8-character unique identifier for this connection

timestamp

string

required

ISO 8601 timestamp when the response was generated

connection_id

string

required

8-character unique identifier for this connection

Error Response

{
  "status": "error",
  "message": "Error description",
  "timestamp": "2026-03-08T10:30:45.123456",
  "connection_id": "a1b2c3d4"
}

status

string

required

Always "error" for failed detection

message

string

required

Human-readable error description

timestamp

string

required

ISO 8601 timestamp when the error occurred

connection_id

string

required

8-character unique identifier for this connection

Implementation Examples

JavaScript

const ws = new WebSocket('wss://languagedetection-9l89.onrender.com/ws');

ws.onopen = () => {
  console.log('WebSocket connected');
  
  // Send audio buffer
  fetch('audio.mp4')
    .then(res => res.arrayBuffer())
    .then(buffer => ws.send(buffer));
};

ws.onmessage = (event) => {
  const response = JSON.parse(event.data);
  
  if (response.status === 'success') {
    console.log('Language:', response.data.language);
    console.log('Processing time:', response.data.processing_time, 's');
  } else {
    console.error('Error:', response.message);
  }
};

ws.onerror = (error) => {
  console.error('WebSocket error:', error);
};

ws.onclose = () => {
  console.log('WebSocket disconnected');
};

Python

import asyncio
import websockets
import json

async def detect_language(audio_file_path):
    uri = "wss://languagedetection-9l89.onrender.com/ws"
    
    async with websockets.connect(uri) as websocket:
        # Read and send audio file
        with open(audio_file_path, 'rb') as f:
            audio_data = f.read()
            await websocket.send(audio_data)
        
        # Receive response
        response = await websocket.recv()
        result = json.loads(response)
        
        if result['status'] == 'success':
            print(f"Language: {result['data']['language']}")
            print(f"Processing time: {result['data']['processing_time']}s")
        else:
            print(f"Error: {result['message']}")

asyncio.run(detect_language('audio.mp4'))

Connection Tracking

Each WebSocket connection receives a unique 8-character connection_id (shortened UUID) used for:

Request tracing in server logs
Correlating responses with requests
Debugging and monitoring

Metrics Impact

Each connection affects the following server metrics:

active_connections: Incremented on connection, decremented on close
total_requests: Incremented when connection is established
errors: Incremented if processing fails
processing_times: Audio processing duration added to rolling buffer

See metrics endpoint for details.

Connection Lifecycle

From websocket_manager.py:18-92:

Accept: Connection accepted, connection_id assigned
Increment: active_connections and total_requests counters updated
Buffer: Audio chunks buffered until minimum size reached
Process: Audio sent to OpenAI API for transcription
Respond: Language detection result sent as JSON
Close: WebSocket closed, active_connections decremented

Error Handling

The endpoint handles these error scenarios:

Empty data chunks (ignored)
OpenAI API errors (returned as error response)
WebSocket disconnection (logged, metrics updated)
General exceptions (logged, error response sent, connection closed)

Endpoints

Backend Components

Configuration

Endpoint

Connection Flow

Message Protocol

Client to Server

Audio Requirements

Server to Client

Success Response

Error Response

Implementation Examples

JavaScript

Python

Connection Tracking

Metrics Impact

Connection Lifecycle

Error Handling

Build docs developers (and LLMs) love

Endpoints

Backend Components

Configuration

​Endpoint

​Connection Flow

​Message Protocol

​Client to Server

​Audio Requirements

​Server to Client

​Success Response

​Error Response

​Implementation Examples

​JavaScript

​Python

​Connection Tracking

​Metrics Impact

​Connection Lifecycle

​Error Handling

Build docs developers (and LLMs) love

Endpoint

Connection Flow

Message Protocol

Client to Server

Audio Requirements

Server to Client

Success Response

Error Response

Implementation Examples

JavaScript

Python

Connection Tracking

Metrics Impact

Connection Lifecycle

Error Handling