Preprocess options

PreprocessOptions controls how Tafrigh preprocesses your audio before splitting and transcription. Proper preprocessing can dramatically improve transcription accuracy by reducing background noise, isolating voice frequencies, and enhancing speech clarity.

Why preprocessing matters

Audio preprocessing improves transcription accuracy by:

Reducing noise: Removes background hiss, hum, and environmental sounds
Isolating speech: Filters out frequencies outside the human voice range
Enhancing clarity: Boosts mid-range frequencies where speech is most prominent
Normalizing volume: Ensures consistent audio levels across the file

Configuration

You pass preprocessOptions to the transcribe() function:

import { transcribe } from 'tafrigh';

const transcript = await transcribe('audio.mp3', {
  preprocessOptions: {
    noiseReduction: {
      afftdnStart: 1,
      afftdnStop: 1,
      afftdn_nf: -25,
      dialogueEnhance: true,
      lowpass: 1,
      highpass: 200,
    },
  },
});

Properties

noiseReduction

object | null

Controls noise reduction and audio filtering. Set to null to skip preprocessing entirely.

All individual filter properties can be set to null to omit that specific filter while keeping others active.

Show properties

highpass

number | null

default:"300"

Frequency in Hz for the high-pass filter.Attenuates frequencies below this cutoff, removing low-frequency noise like rumble, hum, or wind. Human speech fundamental frequencies typically range from 85 Hz (male) to 255 Hz (female).Set to null to disable the high-pass filter entirely.Typical values:

Male voices: 100 to 200 Hz
Female voices: 150 to 300 Hz
General use: 300 Hz (default)

lowpass

number | null

default:"3000"

Frequency in Hz for the low-pass filter.Allows frequencies below this cutoff while attenuating higher frequencies. Removes high-frequency noise like hiss or electronic interference. Speech intelligibility frequencies are typically below 4000 Hz.Set to null to disable the low-pass filter entirely.Typical values:

Telephone quality: 3400 Hz
General speech: 3000 Hz (default)
High fidelity: 4000 to 5000 Hz

afftdnStart

number | null

default:"0"

Time in seconds to begin FFT-based noise reduction.This marks the start of the noise profile sampling period. The denoiser analyzes this portion of audio to learn what “noise” sounds like. Must be used together with afftdnStop.Set to null to disable FFT-based denoising entirely.

The audio between afftdnStart and afftdnStop should contain only noise, no speech. Choose a segment with background noise only.

afftdnStop

number | null

default:"1.5"

Time in seconds to end FFT-based noise reduction sampling.This marks the end of the noise profile sampling period. The denoiser uses the audio between afftdnStart and afftdnStop to build a noise profile. Must be used together with afftdnStart.Set to null to disable FFT-based denoising entirely.

afftdn_nf

number | null

default:"-20"

Noise floor parameter in dB for FFT-based denoising.Controls the threshold for what the denoiser considers “noise.” Lower values (more negative) are more aggressive at removing noise but may affect speech quality.Set to null to disable FFT-based denoising entirely.Typical values:

Light noise reduction: -15 to -10 dB
Moderate noise reduction: -20 to -25 dB (default: -20 dB)
Aggressive noise reduction: -30 to -40 dB

dialogueEnhance

boolean

default:"true"

Enable dialogue enhancement to boost speech clarity.Enhances mid-range frequencies (typically 1-4 kHz) where human speech is most prominent. Makes dialogue easier to understand and improves transcription accuracy.Set to false to disable dialogue enhancement.

Examples

Disable preprocessing completely

Skip all preprocessing when your audio is already clean:

const transcript = await transcribe('clean-audio.mp3', {
  preprocessOptions: {
    noiseReduction: null,
  },
});

Light noise reduction

For moderately clean recordings:

const transcript = await transcribe('podcast.mp3', {
  preprocessOptions: {
    noiseReduction: {
      highpass: 200,
      lowpass: 3500,
      afftdnStart: 0,
      afftdnStop: 0.5,
      afftdn_nf: -15,
      dialogueEnhance: true,
    },
  },
});

Aggressive noise reduction

For noisy environments or low-quality recordings:

const transcript = await transcribe('noisy-recording.mp3', {
  preprocessOptions: {
    noiseReduction: {
      highpass: 250,
      lowpass: 3000,
      afftdnStart: 0.5,
      afftdnStop: 2.5,
      afftdn_nf: -35,
      dialogueEnhance: true,
    },
  },
});

Male voice optimization

Optimized for lower-pitched male voices:

const transcript = await transcribe('male-speaker.mp3', {
  preprocessOptions: {
    noiseReduction: {
      highpass: 100,  // Lower to preserve male fundamentals
      lowpass: 3000,
      afftdnStart: 0,
      afftdnStop: 1,
      afftdn_nf: -20,
      dialogueEnhance: true,
    },
  },
});

Female voice optimization

Optimized for higher-pitched female voices:

const transcript = await transcribe('female-speaker.mp3', {
  preprocessOptions: {
    noiseReduction: {
      highpass: 200,  // Higher to remove more low-frequency noise
      lowpass: 4000,  // Higher to preserve more harmonics
      afftdnStart: 0,
      afftdnStop: 1,
      afftdn_nf: -20,
      dialogueEnhance: true,
    },
  },
});

Selective filtering

Use only specific filters by setting others to null:

const transcript = await transcribe('audio.mp3', {
  preprocessOptions: {
    noiseReduction: {
      highpass: 300,
      lowpass: null,           // Disable low-pass filter
      afftdnStart: null,       // Disable FFT denoising
      afftdnStop: null,
      afftdn_nf: null,
      dialogueEnhance: true,
    },
  },
});

Preprocessing callbacks

Monitor preprocessing progress with callbacks:

const transcript = await transcribe('audio.mp3', {
  preprocessOptions: {
    noiseReduction: {
      // ... your settings
    },
  },
  callbacks: {
    onPreprocessingStarted: async (filePath) => {
      console.log(`Starting preprocessing: ${filePath}`);
    },
    onPreprocessingProgress: async (percent) => {
      console.log(`Preprocessing: ${percent}% complete`);
    },
    onPreprocessingFinished: async (filePath) => {
      console.log(`Finished preprocessing: ${filePath}`);
    },
  },
});

Finding the right noise sample

For FFT-based denoising (afftdnStart, afftdnStop, afftdn_nf), you need a portion of your audio that contains only background noise:

Listen to your audio and find a segment with no speech
Note the timestamp where noise-only audio begins
Set the duration to capture 0.5-3 seconds of noise
Configure the parameters:

const transcript = await transcribe('audio.mp3', {
  preprocessOptions: {
    noiseReduction: {
      afftdnStart: 5.0,    // Noise sample starts at 5 seconds
      afftdnStop: 7.0,     // Noise sample ends at 7 seconds
      afftdn_nf: -25,      // Moderate noise reduction
      // ... other settings
    },
  },
});

If your audio has no clean noise-only segments, set afftdnStart, afftdnStop, and afftdn_nf to null and rely on the highpass/lowpass filters instead.

Understanding filter interactions

Preprocessing filters are applied in sequence:

High-pass filter: Removes low frequencies (rumble, hum)
Low-pass filter: Removes high frequencies (hiss, artifacts)
FFT denoising: Learns and removes noise profile
Dialogue enhancement: Boosts speech frequencies
Normalization: Ensures consistent volume levels

Each filter builds on the previous one to progressively improve audio quality.

Over-aggressive filtering can remove important speech information. Start with default values and adjust incrementally based on transcription results.

Noise reduction

Deep dive into noise reduction options

Split options

Configure audio chunk splitting

Core Functions

Types

Configuration

Preprocess options

Why preprocessing matters

Configuration

Properties

Examples

Disable preprocessing completely

Light noise reduction

Aggressive noise reduction

Male voice optimization

Female voice optimization

Selective filtering

Preprocessing callbacks

Finding the right noise sample

Understanding filter interactions

Noise reduction

Split options

Build docs developers (and LLMs) love

Core Functions

Types

Configuration

​Why preprocessing matters

​Configuration

​Properties

​Examples

​Disable preprocessing completely

​Light noise reduction

​Aggressive noise reduction

​Male voice optimization

​Female voice optimization

​Selective filtering

​Preprocessing callbacks

​Finding the right noise sample

​Understanding filter interactions

​Related

Noise reduction

Split options

Build docs developers (and LLMs) love

Why preprocessing matters

Configuration

Properties

Examples

Disable preprocessing completely

Light noise reduction

Aggressive noise reduction

Male voice optimization

Female voice optimization

Selective filtering

Preprocessing callbacks

Finding the right noise sample

Understanding filter interactions

Related