Overview
The Image-to-Video API provides access to 60+ models that animate static images into dynamic videos. These models automatically activate when you upload a start frame image in the Video Studio.Image-to-video models support single or multi-image input (up to 9 images for some models) for character consistency and scene composition.
Model Categories
Multi-Image Animation Models
Multi-Image Animation Models
Advanced models that accept multiple reference images for consistent character animation.
- Seedance 2.0 I2V — Up to 9 images, 5s/10s/15s duration
- Openai Sora 2 / 2 Pro I2V — Multiple images with audio support
- Pixverse v4.5 / v5 I2V — Multi-image with 5s/8s options
- Vidu v2.0 I2V — Multiple images for 4s mobile videos
- Vidu Q1 Reference — Multi-image character animation
- Seedance Lite Reference Video — Multi-image reference-to-video
- Wan2.1 Reference Video — Product and object animation
High-Quality Animation Models
High-Quality Animation Models
Premium models for photorealistic and cinematic animation.
- Kling v2.1 Master/Pro I2V — 5s/10s photorealistic motion
- Kling v2.5/2.6 Pro I2V — Enhanced character animation
- Veo3 / Veo3 Fast I2V — Google’s image-to-video models
- Veo3.1 / Veo3.1 Fast I2V — 8s 1080p animation
- Runway Act Two I2V — Professional cinematic effects
- Leonardo AI Motion 2.0 — Slow-motion and dynamic effects
Seedance Animation Models
Seedance Animation Models
ByteDance’s animation models with quality and resolution controls.
- Seedance Lite I2V — Fast 3–12s generation
- Seedance Pro I2V — High-quality cinematic animation
- Seedance 2.0 I2V — Latest with 15s support and multi-image
- Seedance Lite Reference Video — Multi-image product animation
Wan Animation Models
Wan Animation Models
Realistic character and portrait animation.
- Wan2.2 I2V — 5s/8s portrait animation
- Wan2.5 I2V / I2V Fast — Enhanced motion quality
- Wan2.6 I2V — Multi-duration support (5s/10s/15s)
- Wan2.1 Reference Video — Multi-image product shots
Specialized Animation Models
Specialized Animation Models
Domain-specific and effect-based animation tools.
- Minimax Hailuo 02 Standard/Pro I2V — 6s/10s animation
- OVI I2V — Creative experimental animation
- Higgsfield Dop I2V — 100+ motion presets (Bullet Time, Dolly, etc.)
- Video Effects — 40+ animated effects (Flying, Melt, Minecraft, etc.)
Popular Models
| Model | Max Images | Duration | Resolution | Best For |
|---|---|---|---|---|
| Seedance 2.0 I2V | 9 | 5s / 10s / 15s | Basic/High | Multi-image character consistency |
| Kling v2.6 Pro I2V | 1 | 5s / 10s | Variable | Photorealistic single-image animation |
| Veo3.1 I2V | 1 | 8s | 1080p | Google-grade quality |
| Sora 2 Pro I2V | Multiple | 10s / 15s / 25s | 720p/1080p | Long-form storytelling with audio |
| Runway Act Two I2V | 1 | Variable | Variable | Professional cinematic effects |
| Pixverse v5 I2V | Multiple | 5s / 8s | 360p–1080p | Flexible multi-image animation |
| Wan2.6 I2V | 1 | 5s / 10s / 15s | Variable | Portrait and character animation |
Newly Added Models
Seedance 2.0 I2V
Family: ByteDanceMax Images: 9
Duration: 5s / 10s / 15s
Quality: Basic / High
Aspect Ratios: 16:9, 9:16, 4:3, 3:4
Features
Features
- Animate up to 9 reference images for character consistency
- Quality presets for speed vs. fidelity tradeoff
- Natural motion and physics simulation
- Wide aspect ratio support
- Seamless multi-image blending
Example Request
Seedance 2.0 I2V can accept up to 9 images to maintain character consistency across complex animations.
Multi-Image Animation
Models with multi-image support maintain character appearance and style consistency across the animation.
Multi-Image Model Support
| Model | Max Images | Use Case |
|---|---|---|
| Seedance 2.0 I2V | 9 | Character consistency across motion |
| Seedance Lite Reference Video | Multiple | Product and object animation |
| Openai Sora 2 / 2 Pro I2V | Multiple | Narrative with character consistency |
| Pixverse v4.5 / v5 I2V | Multiple | Flexible multi-reference animation |
| Vidu v2.0 I2V | Multiple | Mobile-optimized character animation |
| Vidu Q1 Reference | Multiple | Cinematic character scenes |
| Wan2.1 Reference Video | Multiple | Product showcase animation |
Example: Multi-Image Animation Request
Common Input Parameters
Image URL / Images List
Type:string (single) or array (multi-image)Required: Yes
Format: Publicly accessible HTTPS URL or Muapi uploaded file URL For single-image models, use
image_url. For multi-image models, use images_list array.
Prompt
Type:stringRequired: Varies by model (optional for some motion preset models)
Max Length: 500–3000 characters Describe the desired motion, camera movement, and changes in the scene. Be specific about what should animate and how. Example Prompts:
Duration
Type:integerOptions: Varies by model (typically 4–25 seconds) Length of the animated video. Some models have fixed durations (e.g., Vidu v2.0 = 4s), while others offer flexible options.
Aspect Ratio
Type:stringOptions:
16:9, 9:16, 1:1, 4:3, 3:4, 21:9, etc.
Some models auto-detect from the input image; others require explicit specification.
Resolution / Quality
- Resolution:
480p,720p,1080p,360p,540p - Quality:
basic,medium,high
Higher resolutions and quality settings consume more credits and take longer to generate.
Model-Specific Features
Higgsfield Dop I2V — 100+ Motion Presets
Higgsfield Dop I2V — 100+ Motion Presets
Select from curated motion and camera effects:Camera Movements:
- 360 Orbit, 3D Rotation, Arc Left/Right
- Dolly In/Out/Left/Right, Dolly Zoom In/Out
- Crane Up/Down/Over The Head
- Crash Zoom In/Out, Zoom In/Out
- Bullet Time, Datamosh, Disintegration
- Fire Breathe, Eyes In, Face Punch
- Clone Explosion, Building Explosion
- Action Run, Baseball Kick, Basketball Dunks
- Boxing, Catwalk, Car Chasing
Video Effects — 40+ Animated Effects
Video Effects — 40+ Animated Effects
Apply preset animations without prompts:
- Balloon Flyaway, Blow Kiss, Body Shake
- Break Glass, Carry Me, Cartoon Doll
- Flying, Gender Swap, Hair Swap
- Minecraft, Pixel Me, Soul Depart
- Zoom In Fast, Zoom Out
Seedance Models — Camera Fixed Option
Seedance Models — Camera Fixed Option
Camera Fixed:
true | falseWhen enabled, locks the camera position and animates only the subject/scene.OVI I2V — Audio Captioning
OVI I2V — Audio Captioning
Supports audio description in prompts:Use
<S>...<E> for speech and <AUDCAP>...<ENDAUDCAP> for ambient audio.Example Use Cases
Portrait Animation
Portrait Animation
Product Showcase
Product Showcase
Character Consistency Animation
Character Consistency Animation
Cinematic Effect
Cinematic Effect
Duration & Resolution Guide
| Use Case | Recommended Duration | Resolution | Models |
|---|---|---|---|
| Social media clips | 4–8s | 720p–1080p | Wan 2.6, Kling v2.6 Pro, Seedance 2.0 |
| YouTube shorts | 10–15s | 1080p | Sora 2 Pro, Seedance 2.0, Wan 2.6 |
| Product demos | 5–8s | 1080p | Runway Act Two, Veo 3.1, Kling v2.6 Pro |
| Character animation | 10–15s | 720p–1080p | Seedance 2.0 I2V, Sora 2 Pro |
| Quick effects | 4–6s | 720p | Video Effects, Higgsfield Dop |
API Endpoints
All image-to-video models use the unified endpoint pattern:Model Selection Tips
Best Practices
- Upload First: Use the Upload File endpoint to host images before animating.
- Start Frame Quality: Use high-quality, well-lit images for best results.
- Be Specific: Describe motion, camera movement, and what should remain static.
- Test Duration: Start with 5s generations before committing to longer durations.
- Camera Fixed: Use
camera_fixed: truefor subject-only animation without camera movement. - Multi-Image Order: For multi-image models, sequence matters—order images logically.
- Resolution Trade-offs: Use 720p for iteration, 1080p for final output.
Rate Limits
Image-to-video generation consumes more credits than text-to-video through the Muapi.ai API. Cost factors include:- Duration: Longer videos cost more
- Resolution: Higher resolutions cost more
- Quality: High-quality modes cost more
- Multi-Image: Multiple input images may cost more
- Model: Premium models cost more than open-source options
