recognize_houndify()

Overview

Performs speech recognition using the Houndify API. Houndify provides fast, accurate speech recognition with support for custom domains and commands.

Method Signature

recognize_houndify(
    audio_data: AudioData,
    client_id: str,
    client_key: str,
    show_all: bool = False
) -> str | tuple[str, float] | dict

Parameters

audio_data

AudioData

required

The audio data to recognize. Must be an AudioData instance.

client_id

str

required

Houndify client ID (Base64-encoded string).See setup instructions below for how to obtain credentials.

client_key

str

required

Houndify client key (Base64-encoded string).

show_all

bool

default:"False"

If True, returns the raw API response as a JSON dictionary. If False, returns a tuple of (transcript, confidence).

Returns

result

tuple[str, float]

When show_all=False, returns (transcript, confidence) where:

transcript: The recognized text
confidence: Confidence score between 0 and 1

response

dict

When show_all=True, returns the raw API response containing:

Disambiguation: Recognition results
ChoiceData: List of alternatives with transcriptions and confidence scores

Exceptions

UnknownValueError

Exception

Raised when the speech is unintelligible

RequestError

Exception

Raised when:

The API request fails
The client ID or key is invalid
There is no internet connection

Example Usage

Basic Recognition

import speech_recognition as sr

# Initialize recognizer
r = sr.Recognizer()

# Your Houndify credentials
HOUNDIFY_CLIENT_ID = "your-client-id"
HOUNDIFY_CLIENT_KEY = "your-client-key"

# Record audio
with sr.Microphone() as source:
    print("Say something!")
    audio = r.listen(source)

# Recognize with Houndify
try:
    text, confidence = r.recognize_houndify(
        audio,
        client_id=HOUNDIFY_CLIENT_ID,
        client_key=HOUNDIFY_CLIENT_KEY
    )
    print(f"You said: {text}")
    print(f"Confidence: {confidence:.2%}")
except sr.UnknownValueError:
    print("Could not understand audio")
except sr.RequestError as e:
    print(f"API error: {e}")

Getting Full API Response

import speech_recognition as sr
import json

HOUNDIFY_CLIENT_ID = "your-client-id"
HOUNDIFY_CLIENT_KEY = "your-client-key"

r = sr.Recognizer()

with sr.Microphone() as source:
    audio = r.listen(source)

try:
    # Get complete response
    response = r.recognize_houndify(
        audio,
        client_id=HOUNDIFY_CLIENT_ID,
        client_key=HOUNDIFY_CLIENT_KEY,
        show_all=True
    )
    
    print(json.dumps(response, indent=2))
    
    # Access alternatives
    if 'Disambiguation' in response and 'ChoiceData' in response['Disambiguation']:
        for i, choice in enumerate(response['Disambiguation']['ChoiceData']):
            print(f"Alternative {i+1}:")
            print(f"  Text: {choice['Transcription']}")
            print(f"  Confidence: {choice['ConfidenceScore']:.2%}")
except sr.UnknownValueError:
    print("Could not understand audio")

From Audio File

import speech_recognition as sr

HOUNDIFY_CLIENT_ID = "your-client-id"
HOUNDIFY_CLIENT_KEY = "your-client-key"

r = sr.Recognizer()

# Load audio file
with sr.AudioFile("speech.wav") as source:
    audio = r.record(source)

try:
    text, confidence = r.recognize_houndify(
        audio,
        client_id=HOUNDIFY_CLIENT_ID,
        client_key=HOUNDIFY_CLIENT_KEY
    )
    print(f"Transcript: {text}")
    print(f"Confidence: {confidence:.2%}")
except sr.RequestError as e:
    print(f"Error: {e}")

Using Environment Variables

import speech_recognition as sr
import os

# Store credentials in environment variables
HOUNDIFY_CLIENT_ID = os.getenv("HOUNDIFY_CLIENT_ID")
HOUNDIFY_CLIENT_KEY = os.getenv("HOUNDIFY_CLIENT_KEY")

r = sr.Recognizer()

with sr.Microphone() as source:
    audio = r.listen(source)

try:
    text, confidence = r.recognize_houndify(
        audio,
        client_id=HOUNDIFY_CLIENT_ID,
        client_key=HOUNDIFY_CLIENT_KEY
    )
    print(f"Transcript: {text}")
except sr.RequestError as e:
    print(f"Error: {e}")

Setup Instructions

1. Create Houndify Account

Go to Houndify
Sign up for a free account
Log in to the Houndify Dashboard

2. Register a New Client

In the dashboard, click Register a new client
Fill in the form:
- Client Name: Give your client a name
- Platform: Select appropriate platform
Continue to Enable Domains page

3. Enable Speech to Text Domain

On the Enable Domains page, find Speech To Text Only
Toggle it to ON
Click Save & Continue

4. Get Client ID and Key

Go to the Houndify Dashboard
Find your client and click View Details
On the client details page, you’ll see:
- Client ID: Base64-encoded string
- Client Key: Base64-encoded string
Copy both values

5. Use in Code

import speech_recognition as sr

HOUNDIFY_CLIENT_ID = "your-base64-client-id"
HOUNDIFY_CLIENT_KEY = "your-base64-client-key"

r = sr.Recognizer()

with sr.Microphone() as source:
    audio = r.listen(source)

text, confidence = r.recognize_houndify(
    audio,
    client_id=HOUNDIFY_CLIENT_ID,
    client_key=HOUNDIFY_CLIENT_KEY
)
print(text)

Language Support

Currently, only English is supported for speech recognition through the “Speech To Text Only” domain. For multilingual support, you would need to use Houndify’s full voice assistant capabilities with custom domains.

Features

Fast Recognition: Houndify is known for low-latency responses
High Accuracy: Good accuracy for conversational speech
Confidence Scores: Returns confidence with each transcription
Free Tier: Includes free tier for development and testing

Notes

Requires internet connection
Audio must be 8 kHz or 16 kHz sample rate
Audio is automatically converted to 16-bit samples
Returns both transcript and confidence score
Client ID and key are Base64-encoded strings
Free tier available with usage limits
Primarily designed for English language

Core Classes

Recognition Methods

Exceptions

Overview

Method Signature

Parameters

Returns

Exceptions

Example Usage

Basic Recognition

Getting Full API Response

From Audio File

Using Environment Variables

Setup Instructions

1. Create Houndify Account

2. Register a New Client

3. Enable Speech to Text Domain

4. Get Client ID and Key

5. Use in Code

Language Support

Features

Notes

Core Classes

Recognition Methods

Exceptions

​Overview

​Method Signature

​Parameters

​Returns

​Exceptions

​Example Usage

​Basic Recognition

​Getting Full API Response

​From Audio File

​Using Environment Variables

​Setup Instructions

​1. Create Houndify Account

​2. Register a New Client

​3. Enable Speech to Text Domain

​4. Get Client ID and Key

​5. Use in Code

​Language Support

​Features

​Notes

Overview

Method Signature

Parameters

Returns

Exceptions

Example Usage

Basic Recognition

Getting Full API Response

From Audio File

Using Environment Variables

Setup Instructions

1. Create Houndify Account

2. Register a New Client

3. Enable Speech to Text Domain

4. Get Client ID and Key

5. Use in Code

Language Support

Features

Notes