What is whisper.rn?
whisper.rn is a React Native binding for whisper.cpp, bringing OpenAI’s powerful Whisper automatic speech recognition (ASR) model to iOS and Android devices with high-performance native inference.On-Device Processing
All transcription happens locally on the device - no cloud APIs, no internet required, complete privacy
Production Ready
Battle-tested in production apps with optimized performance for mobile devices
Hardware Acceleration
Supports Metal GPU on iOS and Core ML for encoder acceleration
Realtime Transcription
Built-in support for live microphone transcription with Voice Activity Detection
Key Features
Multiple Transcription Modes
Multiple Transcription Modes
- File Transcription: Process audio files (WAV, base64 encoded audio)
- Realtime Transcription: Live microphone input with VAD auto-slicing
- Data Transcription: Direct PCM data processing via JSI (zero-copy ArrayBuffer)
Voice Activity Detection (VAD)
Voice Activity Detection (VAD)
Detect speech segments using the Silero VAD model with configurable presets:
default,sensitive,very-sensitiveconservative,very-conservativecontinuous,meeting,noisy
Platform Optimizations
Platform Optimizations
iOS:
- Core ML encoder acceleration (iOS 15.0+)
- Metal GPU acceleration
- Pre-built xcframework for faster builds
- Flash Attention support
- Optimized NDK builds
- 16KB page size support (Android 15+)
- ARM NEON optimizations
Model Support
Model Support
All Whisper model sizes supported:
tiny,tiny.en(39-75 MB)base,base.en(142-148 MB)small,small.en(466-488 MB)medium,medium.en(1.5 GB)large-v1,large-v2,large-v3(2.9 GB)
Performance
On an iPhone 13 Pro Max, the
tiny.en model with Core ML processes a 30-second audio clip in approximately 800ms in release mode.- Device CPU/GPU capabilities
- Model size (tiny/base/small/medium/large)
- Audio duration and complexity
- Whether Core ML/GPU acceleration is enabled
Quick Navigation
Installation
Get whisper.rn installed in your React Native app with platform-specific setup
Quick Start
Transcribe your first audio file in under 5 minutes
API Reference
Complete API documentation for all features
Examples
Real-world examples and use cases
Use Cases
- Voice Notes: Transcribe voice memos and notes
- Accessibility: Real-time captions for video content
- Meeting Transcription: Record and transcribe meetings
- Language Learning: Transcribe practice conversations
- Voice Commands: Build custom voice-controlled interfaces
- Subtitles: Generate subtitles for video content
Community
whisper.rn is open source and built by BRICKS.GitHub Repository
Star, fork, and contribute on GitHub
Report Issues
Found a bug? Let us know