SliceManager

Overview

SliceManager handles automatic slicing of continuous audio streams into manageable chunks for transcription. It manages memory efficiently by limiting the number of slices kept in memory and automatically releasing old slices. Key Features:

Automatic audio slicing based on duration
Circular buffer strategy for memory management
Configurable slice duration and memory limits
Audio data retrieval for transcription
Memory usage tracking and statistics

Constructor

new SliceManager(
  sliceDurationSec?: number,
  maxSlicesInMemory?: number,
  sampleRate?: number
)

Parameters

sliceDurationSec

number

default:"30"

Duration of each audio slice in seconds

maxSlicesInMemory

number

default:"1"

Maximum number of audio slices to keep in memory. Older slices are automatically released when this limit is exceeded.

sampleRate

number

default:"16000"

Audio sample rate in Hz (whisper.cpp requires 16000 Hz)

Methods

addAudioData()

Adds audio data to the current slice. Automatically creates new slices when capacity is reached.

sliceManager.addAudioData(audioData: Uint8Array): { slice?: AudioSlice }

audioData

Uint8Array

required

Raw audio data to add (16-bit PCM format, 2 bytes per sample)

slice

AudioSlice

The current audio slice being built

The method automatically:

Creates a new slice if none exists
Appends data to the current slice
Finalizes and creates a new slice when capacity (80% full) is reached
Cleans up old slices when memory limit is exceeded

getSliceForTranscription()

Returns the next unprocessed slice ready for transcription.

sliceManager.getSliceForTranscription(): AudioSlice | null

AudioSlice | null

The next slice to transcribe, or null if no unprocessed slices are available

getAudioDataForTranscription()

Retrrieves audio data for a specific slice index.

sliceManager.getAudioDataForTranscription(sliceIndex: number): Uint8Array | null

sliceIndex

number

required

Index of the slice to retrieve audio data from

Uint8Array | null

Audio data for the specified slice, or null if slice not found or has no data

getSliceByIndex()

Retrieves a specific slice by its index.

sliceManager.getSliceByIndex(sliceIndex: number): AudioSlice | null

sliceIndex

number

required

Index of the slice to retrieve

AudioSlice | null

The requested slice, or null if not found

markSliceAsProcessed()

Marks a slice as processed after transcription.

sliceManager.markSliceAsProcessed(sliceIndex: number): void

sliceIndex

number

required

Index of the slice to mark as processed

moveToNextTranscribeSlice()

Moves the internal transcription pointer to the next slice.

sliceManager.moveToNextTranscribeSlice(): void

forceNextSlice()

Forces finalization of the current slice and moves to the next one, regardless of capacity.

sliceManager.forceNextSlice(): { slice?: AudioSlice }

slice

AudioSlice

The finalized slice, if it contains data

Useful for manually triggering slice finalization (e.g., on speech_end events).

getMemoryUsage()

Returns current memory usage statistics.

sliceManager.getMemoryUsage(): MemoryUsage

slicesInMemory

number

Number of active slices in memory (not released)

totalSamples

number

Total number of audio samples across all active slices

estimatedMB

number

Estimated memory usage in megabytes (rounded to 2 decimal places)

getCurrentSliceInfo()

Returns information about the current slice state.

sliceManager.getCurrentSliceInfo(): SliceInfo

currentSliceIndex

number

Index of the slice currently being built

transcribeSliceIndex

number

Index of the next slice to be transcribed

totalSlices

number

Total number of slices in memory

memoryUsage

MemoryUsage

Current memory usage statistics

reset()

Resets the slice manager, releasing all slices and resetting indices.

sliceManager.reset(): void

This:

Releases all slices (clears audio data)
Resets slice indices to 0
Clears the internal slice array

Types

AudioSlice

index

number

Unique index of the slice

data

Uint8Array

Raw audio data (16-bit PCM)

sampleCount

number

Number of bytes in the audio data

startTime

number

Timestamp when slice was created (milliseconds)

endTime

number

Timestamp of last data added to slice (milliseconds)

isProcessed

boolean

Whether the slice has been transcribed

isReleased

boolean

Whether the slice has been released from memory

MemoryUsage

slicesInMemory

number

Number of active slices

totalSamples

number

Total audio samples

estimatedMB

number

Estimated memory in MB

Example Usage

import { SliceManager } from 'whisper.rn'

// Create a slice manager with 30-second slices, keeping max 3 in memory
const sliceManager = new SliceManager(30, 3, 16000)

// Add audio data (e.g., from microphone)
const { slice } = sliceManager.addAudioData(audioChunk)

// Get slice for transcription
const sliceToTranscribe = sliceManager.getSliceForTranscription()

if (sliceToTranscribe) {
  const audioData = sliceManager.getAudioDataForTranscription(
    sliceToTranscribe.index
  )
  
  // Transcribe the audio data...
  // ...
  
  // Mark as processed
  sliceManager.markSliceAsProcessed(sliceToTranscribe.index)
  sliceManager.moveToNextTranscribeSlice()
}

// Check memory usage
const memUsage = sliceManager.getMemoryUsage()
console.log(`Memory: ${memUsage.estimatedMB} MB, ${memUsage.slicesInMemory} slices`)

// Force next slice (e.g., on speech_end event)
const { slice: finalizedSlice } = sliceManager.forceNextSlice()

// Get current state
const info = sliceManager.getCurrentSliceInfo()
console.log(`Current slice: ${info.currentSliceIndex}`)

// Reset when done
sliceManager.reset()

Memory Management Strategy

SliceManager uses a circular buffer approach:

Slice Creation: New slices are created when audio data is added
Auto-Finalization: Slices are finalized when they reach 80% capacity
Memory Limit: When maxSlicesInMemory is exceeded, oldest slices are released
Data Cleanup: Released slices have their audio data cleared to free memory
Slice Retention: Only the most recent maxSlicesInMemory slices are kept

This ensures:

Predictable memory usage
No memory leaks from unbounded audio buffers
Access to recent audio history for context
Automatic garbage collection of old data

Core API

Voice Activity Detection

Realtime Transcription

Types & Interfaces

Utilities

Overview

Constructor

Parameters

Methods

addAudioData()

getSliceForTranscription()

getAudioDataForTranscription()

getSliceByIndex()

markSliceAsProcessed()

moveToNextTranscribeSlice()

forceNextSlice()

getMemoryUsage()

getCurrentSliceInfo()

reset()

Types

AudioSlice

MemoryUsage

Example Usage

Memory Management Strategy

Build docs developers (and LLMs) love

Core API

Voice Activity Detection

Realtime Transcription

Types & Interfaces

Utilities

​Overview

​Constructor

​Parameters

​Methods

​addAudioData()

​getSliceForTranscription()

​getAudioDataForTranscription()

​getSliceByIndex()

​markSliceAsProcessed()

​moveToNextTranscribeSlice()

​forceNextSlice()

​getMemoryUsage()

​getCurrentSliceInfo()

​reset()

​Types

​AudioSlice

​MemoryUsage

​Example Usage

​Memory Management Strategy

Build docs developers (and LLMs) love

Overview

Constructor

Parameters

Methods

addAudioData()

getSliceForTranscription()

getAudioDataForTranscription()

getSliceByIndex()

markSliceAsProcessed()

moveToNextTranscribeSlice()

forceNextSlice()

getMemoryUsage()

getCurrentSliceInfo()

reset()

Types

AudioSlice

MemoryUsage

Example Usage

Memory Management Strategy