Other STT Models
This page covers additional STT model types supported by react-native-sherpa-onnx, including specialized and emerging architectures.Overview
WeNet CTC
Compact CTC models from WeNet framework
SenseVoice
Multilingual with emotion detection and punctuation
FunASR Nano
LLM-based ASR with prompt customization
Moonshine
Modern streaming-capable lightweight ASR
Fire Red ASR
Encoder-decoder ASR models
Dolphin
Single-model CTC for compact deployment
Canary
NeMo multilingual model
Omnilingual
Wide language coverage CTC model
MedASR
Medical ASR for healthcare applications
Telespeech CTC
Telephony-optimized CTC model
Tone CTC
Ultra-lightweight streaming CTC (t-one)
WeNet CTC
modelType: 'wenet_ctc'
Description
CTC models from the WeNet framework, designed for compact deployment.Characteristics
- Streaming: ❌ No (offline only)
- Speed: ⭐⭐⭐⭐⭐ Very Fast
- Size: Small (compact models)
- Languages: Limited (depends on model variant)
Configuration
Download
WeNet CTC ModelsModel Detection
- Folder name should contain
wenet - Files:
model.onnx,tokens.txt
SenseVoice
modelType: 'sense_voice'
Description
Multilingual model with emotion detection and automatic punctuation. Excellent for applications requiring sentiment analysis.Characteristics
- Streaming: ❌ No
- Accuracy: ⭐⭐⭐⭐
- Languages: Chinese, English, Cantonese, Japanese, Korean
- Special: Emotion labels + punctuation
Configuration
Language Helpers
Download
SenseVoice ModelsModel Detection
- Folder name should contain
senseorsensevoice
FunASR Nano
modelType: 'funasr_nano'
Description
Lightweight LLM-based ASR with customizable system/user prompts. Supports advanced decoding options.Characteristics
- Streaming: ❌ No
- Special: LLM-based with prompt engineering
- Languages: Chinese, English, Japanese (depends on variant)
Configuration
Language Helpers
Download
FunASR Nano ModelsModel Detection
- Folder name should contain
funasrorfunasr-nano - Files:
encoder_adaptor,llm,embedding, tokenizer directory
Moonshine
modelType: 'moonshine' (v1) or 'moonshine_v2' (v2)
Description
Modern streaming-capable ASR with two architecture versions. Moonshine v1: Four-part architecture (preprocess, encode, uncached/cached decode)Moonshine v2: Two-part architecture (encoder + merged decoder)
Characteristics
- Streaming: ✅ Yes (both v1 and v2)
- Speed: ⭐⭐⭐⭐
- Languages: Limited (check model variant)
Configuration
Download
Moonshine ModelsModel Detection
- Folder name should contain
moonshine - V1:
preprocess.onnx,encode.onnx,uncached_decode.onnx,cached_decode.onnx - V2:
encoder.onnxorencoder.ort, merged decoder
Fire Red ASR
modelType: 'fire_red_asr'
Description
Encoder-decoder ASR models from the Fire Red project.Characteristics
- Streaming: ❌ No
- Speed: ⭐⭐⭐
- Languages: Limited (depends on variant)
Configuration
Download
Fire Red ASR ModelsModel Detection
- Folder name should contain
fire_redorfire-red - Files: encoder, decoder directories
Dolphin
modelType: 'dolphin'
Description
Single-model CTC for compact deployment.Characteristics
- Streaming: ❌ No
- Speed: ⭐⭐⭐⭐⭐
- Size: Very Small
- Languages: Limited
Configuration
Download
Dolphin ModelsModel Detection
- Folder name should contain
dolphin - Files:
model.onnx,tokens.txt
Canary
modelType: 'canary'
Description
NeMo Canary multilingual model with source/target language configuration.Characteristics
- Streaming: ❌ No
- Multilingual: ✅ Yes (English, Spanish, German, French)
- Accuracy: ⭐⭐⭐⭐
Configuration
Language Helpers
Download
Canary ModelsModel Detection
- Folder name should contain
canary
Omnilingual
modelType: 'omnilingual'
Description
Omnilingual CTC model with wide language coverage.Characteristics
- Streaming: ❌ No
- Multilingual: ✅ Yes (many languages)
- Speed: ⭐⭐⭐
Configuration
Download
Omnilingual ModelsModel Detection
- Folder name should contain
omnilingual
MedASR
modelType: 'medasr'
Description
Medical ASR CTC model optimized for healthcare terminology.Characteristics
- Streaming: ❌ No
- Domain: Medical/Healthcare
- Speed: ⭐⭐⭐⭐
Configuration
Model Detection
- Folder name should contain
medasr
Telespeech CTC
modelType: 'telespeech_ctc'
Description
Telespeech CTC model optimized for telephony audio.Characteristics
- Streaming: ❌ No
- Domain: Telephony (8kHz audio)
- Speed: ⭐⭐⭐⭐
Configuration
Download
Telespeech ModelsModel Detection
- Folder name should contain
telespeech
Tone CTC (t-one)
modelType: 'tone_ctc'
Description
Ultra-lightweight streaming CTC model (t-one). Excellent for resource-constrained devices.Characteristics
- Streaming: ✅ Yes
- Speed: ⭐⭐⭐⭐⭐ Very Fast
- Size: Very Small
- Memory: ⭐⭐⭐⭐⭐ Very Low
Configuration
Download
Tone CTC ModelsModel Detection
- Folder name should contain
t-one,t_one, or the wordtone(as standalone word) - Files:
model.onnx,tokens.txt
Comparison Table
| Model | Streaming | Multilingual | Speed | Special Feature |
|---|---|---|---|---|
| WeNet CTC | ❌ | Limited | Very Fast | Compact |
| SenseVoice | ❌ | 5 langs | Medium | Emotion + punctuation |
| FunASR Nano | ❌ | Limited | Medium | LLM-based with prompts |
| Moonshine | ✅ | Limited | Fast | Modern streaming |
| Fire Red ASR | ❌ | Limited | Medium | Encoder-decoder |
| Dolphin | ❌ | Limited | Very Fast | Ultra-compact |
| Canary | ❌ | 4 langs | Medium | NeMo multilingual |
| Omnilingual | ❌ | Many | Medium | Wide coverage |
| MedASR | ❌ | English | Fast | Medical domain |
| Telespeech | ❌ | Limited | Fast | Telephony (8kHz) |
| Tone CTC | ✅ | Limited | Very Fast | Ultra-lightweight |
Choosing a Specialized Model
For Emotion Detection
- SenseVoice – Provides emotion labels in result
For Medical/Healthcare
- MedASR – Optimized for medical terminology
For Telephony
- Telespeech CTC – Designed for 8kHz phone audio
For Low-End Devices
- Tone CTC – Ultra-lightweight streaming
- Dolphin – Very small offline model
- WeNet CTC – Compact deployment
For LLM-Based Flexibility
- FunASR Nano – Prompt engineering for ASR
For Modern Streaming
- Moonshine – Latest streaming architecture
- Tone CTC – Lightweight streaming
Next Steps
STT Overview
Compare all STT model types
STT API
Detailed API documentation
Streaming STT
Real-time recognition guide
Model Setup
How to download and bundle models