Available models
Qwen3-ASR
Multilingual ASR supporting 30+ languages with 30-50x real-time speed
Paraformer
Non-autoregressive Chinese ASR with 18x+ real-time speed
FunASR-Nano
LLM-based 800M parameter model supporting 31 languages
Performance comparison
| Model | Languages | Speed | Architecture | Parameters |
|---|---|---|---|---|
| Qwen3-ASR-1.7B | 30+ languages | 30x RT | Encoder-decoder | 1.7B |
| Qwen3-ASR-0.6B | 30+ languages | 22x RT | Encoder-decoder | 0.6B |
| Paraformer | Chinese | 18-75x RT | Non-autoregressive | 220M |
| FunASR-Nano | 31 languages | ~10x RT | LLM-based | 800M |
Key features
Pure Rust implementation
All models are implemented in Rust with zero Python dependencies at runtime:- Native Metal GPU acceleration via MLX
- Efficient memory management
- Cross-platform binary distribution
- Direct integration into Rust applications
Optimized for Apple Silicon
- Metal GPU acceleration for neural network operations
- Accelerate framework for audio processing (FFT, resampling)
- 8-bit quantization support for reduced memory usage
- Efficient batch processing for long-form audio
Production-ready API
Unified API server provides OpenAI-compatible endpoints:Architecture overview
Qwen3-ASR architecture
Paraformer architecture
FunASR-Nano architecture
Supported audio formats
All models support:- WAV - Native support (any sample rate, mono/stereo)
- MP3, M4A, FLAC, OGG, AAC - Automatic conversion via ffmpeg
- Raw samples - Direct
f32array input at 16kHz
Model selection guide
Choose Qwen3-ASR when you need:
- Multilingual support (30+ languages)
- Best accuracy on Chinese, English, Japanese, Korean
- Long-form audio transcription (automatic 30s chunking)
- Production-grade quality and speed balance
Choose Paraformer when you need:
- Chinese-only transcription
- Maximum speed (non-autoregressive)
- Lower memory footprint
- Extremely fast inference for short audio
Choose FunASR-Nano when you need:
- 31 language support including dialects
- Far-field/noisy environment robustness
- Regional accent recognition
- LLM-based semantic understanding
Quick start
Next steps
Qwen3-ASR
Learn about the multilingual Qwen3-ASR models
Paraformer
Explore the high-speed Paraformer Chinese ASR
FunASR-Nano
Discover the LLM-based FunASR-Nano
API Reference
View the unified API documentation