CPU transcription is 5-10x slower than GPU mode. A 5-minute video may take 3-5 minutes to transcribe on CPU vs. 30 seconds on GPU.
System Requirements
Minimum Requirements
Minimum Requirements
- CPU: Modern multi-core processor (Intel i5/AMD Ryzen 5 or better)
- RAM: 8GB minimum, 16GB recommended
- Python: 3.10 or higher
- Storage: 2GB for dependencies + space for video files
- OS: Windows 10/11, Ubuntu 20.04+, or macOS 11+
Installation by Platform
- Windows
- Linux
- macOS
Windows CPU Installation
Install FFmpeg
Option 1: Using Chocolatey (Recommended)Option 2: Using ScoopOption 3: Manual Installation
- Download from ffmpeg.org/download.html
- Extract to
C:\ffmpeg - Add
C:\ffmpeg\binto system PATH
Install ImageMagick
C:\Program Files\ImageMagick-7.x.x-Q16-HDRI\config\policy.xmlFind:Verify Installation
Confirm CPU mode is active:CPU Requirements File
Therequirements-cpu.txt file excludes all NVIDIA CUDA packages that are in the standard requirements.txt. Key differences:
What's Different in requirements-cpu.txt
What's Different in requirements-cpu.txt
Excluded packages:
- All
nvidia-*packages (CUDA runtime, cuDNN, etc.) - GPU-specific PyTorch wheels
- CPU-only versions of core dependencies
faster-whisper==1.0.1- CPU-compatible transcriptiontorch(installed separately from CPU index)- All other standard dependencies (OpenCV, MoviePy, LangChain, etc.)
Full Requirements List
Therequirements-cpu.txt includes:
PyTorch must be installed separately from the CPU index before installing
requirements-cpu.txt.Performance Comparison
Typical processing times for a 5-minute YouTube video:| Task | GPU (CUDA) | CPU |
|---|---|---|
| Download | 10-30s | 10-30s |
| Audio extraction | 2-5s | 2-5s |
| Transcription | 30-60s | 3-5 min |
| Highlight selection | 5-10s | 5-10s |
| Video processing | 20-40s | 25-50s |
| Total | 1-2 min | 4-7 min |
Running in CPU Mode
Usage is identical to GPU mode:Interactive Mode
Command-Line Mode
Batch Processing
The application automatically detects CPU mode and adjusts accordingly. No code changes are needed.
Optimization Tips
Close Background Applications
Free up RAM by closing unnecessary programs, especially browsers with many tabs.
Process Shorter Videos First
Start with 2-5 minute videos to get faster results. The highlight selection works best with 5-15 minute source videos anyway.
Switching Between CPU and GPU
- CPU → GPU
- GPU → CPU
Troubleshooting CPU Installation
Issue: Still Using GPU Libraries
If you see CUDA-related errors after CPU installation:Issue: Out of Memory Errors
If transcription fails with memory errors:- Close other applications to free RAM
- Process shorter video segments (< 5 minutes)
- Increase system swap space (Linux) or virtual memory (Windows)
Issue: Very Slow Performance
If CPU transcription is taking excessively long:- Check CPU usage: Should be near 100% during transcription
- Disable power saving: Ensure your system isn’t throttling CPU
- Consider alternatives: The AI Clipping API offers faster cloud processing
When to Use CPU Mode
CPU mode is ideal for:- 💻 Laptops without dedicated GPUs
- 🖥️ Desktop systems with AMD or Intel graphics
- 🍎 Apple Silicon Macs (M1/M2/M3)
- 📦 Minimal installation footprint (~800MB vs. ~4GB)
- 🧪 Testing and development
- ⚡ Frequent video processing
- 📹 Long-form content (15-30 minute videos)
- 🏭 Batch processing many videos
- ⏱️ Time-sensitive workflows
All features work identically in both modes. The only difference is processing speed.
Alternative Solutions
If CPU performance is insufficient:Cloud Processing Options
Cloud Processing Options
-
AI Clipping API: muapi.ai/playground/ai-clipping
- No installation required
- Faster processing
- Better highlight selection algorithms
-
Google Colab: Run the tool on free GPU instances
- Requires adapting the code for notebook format
- Free tier has usage limits
-
Cloud VM with GPU: AWS, GCP, or Azure instances
- Rent by the hour
- Use the standard GPU installation
