Microphone Input

The Microphone class provides access to your computer’s audio input devices for real-time speech recognition. This guide covers device selection, configuration, and best practices for working with microphone input.

Basic Usage

The simplest way to capture audio from your default microphone:

import speech_recognition as sr

# Create recognizer instance
r = sr.Recognizer()

# Capture audio from default microphone
with sr.Microphone() as source:
    print("Say something!")
    audio = r.listen(source)

# Recognize the audio
try:
    text = r.recognize_google(audio)
    print(f"You said: {text}")
except sr.UnknownValueError:
    print("Could not understand audio")
except sr.RequestError as e:
    print(f"Error: {e}")

Device Selection

Listing Available Microphones

Use list_microphone_names() to discover all available audio input devices:

import speech_recognition as sr

# List all microphone names
for index, name in enumerate(sr.Microphone.list_microphone_names()):
    print(f"Microphone {index}: {name}")

Example output:

Microphone 0: Built-in Microphone
Microphone 1: USB Audio Device
Microphone 2: Blue Yeti

Using a Specific Device

Select a microphone by its index:

import speech_recognition as sr

r = sr.Recognizer()

# Use microphone at index 2
with sr.Microphone(device_index=2) as source:
    print("Listening on Blue Yeti...")
    audio = r.listen(source)

Device indices range from 0 to pyaudio.get_device_count() - 1. If you specify an invalid index, you’ll get an assertion error with the valid range.

Finding Working Microphones

The list_working_microphones() method identifies microphones that are actively receiving audio:

import speech_recognition as sr

print("Make some noise into your microphone...")
working_mics = sr.Microphone.list_working_microphones()

for index, name in working_mics.items():
    print(f"Working microphone {index}: {name}")

# Use the first working microphone
if working_mics:
    first_working_index = list(working_mics.keys())[0]
    with sr.Microphone(device_index=first_working_index) as source:
        print(f"Using {working_mics[first_working_index]}")
        audio = r.listen(source)

When using list_working_microphones(), ensure your microphone is unmuted and make some noise to help the method detect active devices.

Microphone Configuration

Sample Rate

The sample rate controls audio quality and processing speed:

import speech_recognition as sr

# Use custom sample rate (default is device's default rate)
with sr.Microphone(sample_rate=16000) as source:
    # Lower sample rates reduce bandwidth but may affect quality
    audio = r.listen(source)

# Let the library auto-detect the optimal rate (recommended)
with sr.Microphone() as source:
    # Uses device's default sample rate
    audio = r.listen(source)

Higher sample rates (e.g., 44100 Hz) provide better audio quality but require more processing power. Some devices like Raspberry Pi may struggle with high sample rates. The default auto-detection usually provides the best balance.

Chunk Size

Chunk size affects noise sensitivity and detection responsiveness:

import speech_recognition as sr

# Default chunk size is 1024 samples
with sr.Microphone(chunk_size=1024) as source:
    audio = r.listen(source)

# Larger chunk size reduces sensitivity to rapid noise changes
with sr.Microphone(chunk_size=2048) as source:
    # Less sensitive but more stable
    audio = r.listen(source)

The default chunk size of 1024 works well for most applications. Only adjust if you’re experiencing issues with ambient noise triggering false positives.

Real-Time Recognition

Import the library

Import SpeechRecognition and create a recognizer:

import speech_recognition as sr

r = sr.Recognizer()

Configure the microphone

Open the microphone as a context manager:

with sr.Microphone() as source:
    # Microphone is active here
    pass

Adjust for ambient noise (optional)

Calibrate the energy threshold for your environment:

with sr.Microphone() as source:
    print("Calibrating for ambient noise... Please wait.")
    r.adjust_for_ambient_noise(source, duration=1)
    print("Ready!")

Listen for speech

Capture audio until a pause is detected:

with sr.Microphone() as source:
    r.adjust_for_ambient_noise(source)
    print("Say something!")
    audio = r.listen(source)

Process the audio

Send the audio to a recognition engine:

try:
    text = r.recognize_google(audio)
    print(f"You said: {text}")
except sr.UnknownValueError:
    print("Could not understand audio")
except sr.RequestError as e:
    print(f"API error: {e}")

Complete Example

Here’s a complete microphone recognition example from the library:

#!/usr/bin/env python3
import speech_recognition as sr

# Create recognizer
r = sr.Recognizer()

# Use default microphone as audio source
with sr.Microphone() as source:
    print("Say something!")
    audio = r.listen(source)

# Try multiple recognition engines
try:
    print("Google Speech Recognition: " + r.recognize_google(audio))
except sr.UnknownValueError:
    print("Google could not understand audio")
except sr.RequestError as e:
    print(f"Google error: {e}")

try:
    print("Whisper: " + r.recognize_whisper(audio))
except sr.UnknownValueError:
    print("Whisper could not understand audio")
except sr.RequestError as e:
    print(f"Whisper error: {e}")

Troubleshooting

PyAudio installation errors

The Microphone class requires PyAudio. If you get an AttributeError: Could not find PyAudio, install it:

# On Ubuntu/Debian
sudo apt-get install portaudio19-dev python3-pyaudio
pip install pyaudio

# On macOS
brew install portaudio
pip install pyaudio

# On Windows
pip install pyaudio

No audio detected

If listen() never returns or times out:

Check that your microphone is not muted
Verify the correct device index is selected
Test with list_working_microphones() while making noise
Adjust the energy threshold:

r = sr.Recognizer()
r.energy_threshold = 4000  # Increase if too sensitive
r.dynamic_energy_threshold = False  # Disable auto-adjustment

Device index out of range

If you get a device index error:

# Check how many devices are available
import speech_recognition as sr
mic_list = sr.Microphone.list_microphone_names()
print(f"Found {len(mic_list)} microphones")
print(f"Valid indices: 0 to {len(mic_list) - 1}")

Audio cutting off too early

If speech is being cut off prematurely:

r = sr.Recognizer()
r.pause_threshold = 1.0  # Increase from default 0.8 seconds
r.phrase_threshold = 0.2  # Reduce minimum phrase length

Poor recognition quality

If recognition accuracy is low:

Use adjust_for_ambient_noise() before each listening session
Position the microphone closer to the speaker
Use a higher-quality microphone
Increase the sample rate if your system can handle it:

with sr.Microphone(sample_rate=44100) as source:
    audio = r.listen(source)

Continuous listening in the background

For applications that need to listen continuously, see the Background Listening guide.

API Reference

Microphone Class

sr.Microphone(
    device_index=None,  # None for default, or integer index
    sample_rate=None,   # None for auto-detect, or integer Hz
    chunk_size=1024     # Number of frames per buffer
)

Class Methods:

list_microphone_names() - Returns list of all microphone names
list_working_microphones() - Returns dict mapping indices to names of active microphones

Context Manager: The Microphone class must be used as a context manager (with with statement) to properly open and close the audio stream. Attributes (after entering context):

SAMPLE_RATE - Sample rate in Hz
SAMPLE_WIDTH - Sample width in bytes
CHUNK - Chunk size in frames

Getting Started

Core Concepts

Recognition Engines

Guides

Examples

Basic Usage

Device Selection

Listing Available Microphones

Using a Specific Device

Finding Working Microphones

Microphone Configuration

Sample Rate

Chunk Size

Real-Time Recognition

Complete Example

Troubleshooting

API Reference

Microphone Class

See Also

Getting Started

Core Concepts

Recognition Engines

Guides

Examples

​Basic Usage

​Device Selection

​Listing Available Microphones

​Using a Specific Device

​Finding Working Microphones

​Microphone Configuration

​Sample Rate

​Chunk Size

​Real-Time Recognition

​Complete Example

​Troubleshooting

​API Reference

​Microphone Class

​See Also

Basic Usage

Device Selection

Listing Available Microphones

Using a Specific Device

Finding Working Microphones

Microphone Configuration

Sample Rate

Chunk Size

Real-Time Recognition

Complete Example

Troubleshooting

API Reference

Microphone Class

See Also