initWhisperVad

Initialize a VAD context for voice activity detection using the Silero VAD model.

Signature

function initWhisperVad(
  options: VadContextOptions
): Promise<WhisperVadContext>

Parameters

options

VadContextOptions

required

Configuration options for the VAD context

Show VadContextOptions properties

filePath

string | number

required

Path to the Silero VAD model file (.bin format), or a require() asset number.

File path: '/path/to/silero_vad.bin'
Asset: require('./assets/silero_vad.bin')
Bundle asset: Set isBundleAsset: true for assets in the app bundle

isBundleAsset

boolean

default:"false"

Whether the file path points to a bundle asset (for string file paths only).

useGpu

boolean

default:"true"

Enable GPU/Metal acceleration if available (iOS only).

nThreads

number

Number of threads to use during computation.Default: 2 for 4-core devices, 4 for devices with more cores.

Returns

WhisperVadContext

Promise<WhisperVadContext>

A promise that resolves to a WhisperVadContext instance.

Show WhisperVadContext properties

number

Unique context identifier

gpu

boolean

Whether GPU/Metal acceleration is active

reasonNoGPU

string

Explanation if GPU is not available (empty string if GPU is active)

Example

import { initWhisperVad } from 'whisper.rn'

// Initialize with a local file
const vadContext = await initWhisperVad({
  filePath: '/path/to/silero_vad.bin',
})

console.log('GPU enabled:', vadContext.gpu)

// Initialize with a bundled asset
const vadContext = await initWhisperVad({
  filePath: require('./assets/silero_vad.bin'),
})

// Initialize with custom settings
const vadContext = await initWhisperVad({
  filePath: '/path/to/silero_vad.bin',
  useGpu: true,
  nThreads: 4,
})

// Use the context for speech detection
const segments = await vadContext.detectSpeech('/path/to/audio.wav')

// Release the context when done
await vadContext.release()

Notes

The VAD model file (Silero VAD) is separate from the Whisper model
Download the Silero VAD model from the whisper.cpp repository or use a pre-converted .bin file
GPU acceleration is currently only available on iOS devices with Metal support
Always call context.release() when finished to free memory
Use releaseAllWhisperVad() to release all VAD contexts at once

WhisperVadContext - Methods for detecting speech
VadOptions - Options for speech detection
releaseAllWhisperVad - Release all VAD contexts

Core API

Voice Activity Detection

Realtime Transcription

Types & Interfaces

Utilities

Signature

Parameters

Returns

Example

Notes

Build docs developers (and LLMs) love

Core API

Voice Activity Detection

Realtime Transcription

Types & Interfaces

Utilities

​Signature

​Parameters

​Returns

​Example

​Notes

​Related

Build docs developers (and LLMs) love

Signature

Parameters

Returns

Example

Notes

Related