Available Video Models
realesr-animevideov3
Optimized for anime videos with XS size
realesr-general-x4v3
Compact model for general video content
realesr-animevideov3
This model is specifically optimized for anime videos with extra-small (XS) size for efficient video processing.
Model Specifications
| Property | Value |
|---|---|
| Scale | 1x, 2x, 3x, or 4x (variable) |
| Architecture | SRVGGNetCompact |
| Convolution Layers | 16 |
| Features | 64 |
| Size | XS (extra small) |
| Activation | PReLU |
| Download | realesr-animevideov3.pth |
Key Features
Temporal Consistency
Designed to maintain consistency across video frames
Lightweight
XS size enables fast processing of video sequences
Variable Scale
Supports 1x, 2x, 3x, and 4x upscaling
Low Memory
Consumes minimal GPU memory for longer videos
realesr-general-x4v3
Model Specifications
| Property | Value |
|---|---|
| Scale | 1x, 2x, 3x, or 4x (variable) |
| Architecture | SRVGGNetCompact |
| Convolution Layers | 32 |
| Features | 64 |
| Size | S (small) |
| Activation | PReLU |
| Download | realesr-general-x4v3.pth |
This model can be used for both general images and videos. It’s slightly larger than the anime video model but still very efficient.
Usage
PyTorch Inference
Command Line Options
Number of processes per GPU. Total processes = num_gpu × num_process_per_gpu.
Helps with GPU utilization since video processing is often IO-bound.
Extract all frames before processing. Enable this if you encounter ffmpeg errors with multi-processing.
Output scale: 1, 2, 3, or 4. Both video models support variable scaling.
Input video file path.
Model name:
realesr-animevideov3 or realesr-general-x4v3.NCNN Executable (Manual Workflow)
For systems without Python or CUDA, use the NCNN portable executable with a manual frame extraction workflow.Get Original FPS
Check the original video’s FPS:Look for the fps value in the output (e.g., “23.98 fps”).
FFmpeg Options Explained
Frame Extraction Options
Frame Extraction Options
Video Encoding Options
Video Encoding Options
Audio Handling
Audio Handling
Model Comparison
Anime Video vs General Video
| Feature | realesr-animevideov3 | realesr-general-x4v3 |
|---|---|---|
| Size | XS | S |
| Conv Layers | 16 | 32 |
| Speed | Fastest | Fast |
| Best For | Anime videos | General videos |
| Memory | Minimal | Low |
| Quality | Good for anime | Good for general content |
Video Models vs Image Models
| Aspect | Video Models | Image Models |
|---|---|---|
| Architecture | SRVGGNetCompact | RRDBNet |
| Size | XS/S | Large |
| Speed | Very fast | Slower |
| Memory | Low | Higher |
| Quality | Good | Best |
| Use Case | Video sequences | Single images |
Video models trade some quality for speed and efficiency, making them practical for processing thousands of video frames.
Performance Optimization
Multi-Processing for Better GPU Utilization
Why Multi-Processing?
Why Multi-Processing?
Video processing is often IO-bound (reading/writing frames), leaving GPUs underutilized. Multi-processing helps maximize GPU usage:Monitor GPU memory and adjust
num_process_per_gpu accordingly.Handling FFmpeg Errors
Extract Frames First
Extract Frames First
If you encounter ffmpeg errors with multi-processing:This extracts all frames before processing, avoiding concurrent ffmpeg access issues.
Best Practices
Choose Right Scale
Start with 2x for faster processing, use 4x only if needed:
Optimize GPU Usage
Use multi-processing for better GPU utilization:
Preserve Audio
Always include audio when merging frames back:
Match FPS
Use original video’s FPS for smooth playback:
Example Workflows
Quick 2x Upscaling (Anime)
High Quality 4x Upscaling (General)
Batch Processing Multiple Videos
Next Steps
Anime Models
Learn about anime image upscaling
General Models
Explore general image models