Overview
The Text-to-Video API provides access to 40+ cutting-edge models that generate videos from text descriptions. From cinematic sequences to animated content, these models cover a wide range of video generation use cases.Text-to-video models are automatically selected when no start frame image is provided in the Video Studio.
Model Categories
Seedance Models
Seedance Models
ByteDance’s video generation models with quality and resolution controls.
- Seedance Lite — Fast generation for quick prototyping
- Seedance Pro / Pro Fast — High-quality cinematic generation
- Seedance v1.5 Pro / Pro Fast — Enhanced motion and detail
- Seedance 2.0 — Latest generation with 15s duration support
- Seedance 2.0 Extend — Seamlessly continue existing videos
Kling Models
Kling Models
Kuaishou’s photorealistic video generation models.
- Kling v2.1 Master — High-fidelity cinematic rendering
- Kling v2.5 Turbo Pro — Fast generation with quality
- Kling v2.6 Pro — 5s/10s duration with ultra-realism
- Kling O1 Pro — Latest generation with improved coherence
- Kling v3.0 Pro / Standard — Next-gen detail and motion
Proprietary Models
Proprietary Models
Industry-leading commercial video generation models.
- Veo 3 / 3 Fast — Google’s video generation model
- Veo 3.1 / 3.1 Fast — Enhanced 8s generation at 1080p
- Sora / Sora 2 / Sora 2 Pro — OpenAI’s flagship models (10–25s)
- Runway Gen-3 — Slow-motion and cinematic effects
- Hailuo 02 Standard / Pro — Minimax 6s/10s generation
- Hailuo 2.3 Standard / Pro — Latest minimax models
Wan Models
Wan Models
Realistic character animation and motion models.
- Wan 2.1 / 2.2 / 2.2 Fast — Portrait and character animation
- Wan 2.5 / 2.5 Fast — Enhanced motion quality
- Wan 2.6 — 5s/10s/15s duration support
Specialized Models
Specialized Models
Domain-specific and open-source video models.
- Hunyuan / Hunyuan Fast — Tencent’s multilingual model
- Pixverse v4.5 / v5 / v5.5 — Anime and illustration styles
- Vidu v2.0 — 4s 9:16 mobile-optimized videos
- OVI — Creative and experimental generation
- Grok Imagine — X.AI’s video model with 6s/10s options
- LTX 2 Pro / Fast / 19B — Open-source with long duration (20s)
Popular Models
| Model | Duration | Resolution | Aspect Ratios | Best For |
|---|---|---|---|---|
| Seedance 2.0 | 5s / 10s / 15s | Basic/High | 16:9, 9:16, 4:3, 3:4 | Production-ready cinematic videos |
| Kling v3.0 Pro | 5s | Variable | 16:9, 9:16, 1:1 | Photorealistic motion |
| Veo 3.1 | 8s | 1080p | 16:9, 9:16 | Google-grade video quality |
| Sora 2 Pro | 10s / 15s / 25s | 720p/1080p | 16:9, 9:16 | Long-form storytelling |
| Runway Gen-3 | 5s / 8s | 720p/1080p | 5 ratios | Cinematic slow-motion |
| Wan 2.6 | 5s / 10s / 15s | 720p/1080p | 16:9, 9:16 | Character animation |
| LTX 2 Fast | 6–20s | 720p | 16:9, 9:16 | Open-source long videos |
Newly Added Models
Seedance 2.0
Family: ByteDanceDuration: 5s / 10s / 15s
Quality: Basic / High
Aspect Ratios: 16:9, 9:16, 4:3, 3:4
Features
Features
- ByteDance’s latest video generation model
- Up to 15 seconds of high-quality video
- Quality presets for speed vs. fidelity tradeoff
- Wide aspect ratio support
- Natural motion and physics simulation
Example Request
Seedance 2.0 Extend
Family: ByteDanceDuration: 5s / 10s / 15s
Quality: Basic / High
Requires:
request_id from original Seedance 2.0 generation
Features
Features
- Seamlessly continue any Seedance 2.0 video
- Preserves style, motion, and audio continuity
- Optional continuation prompt for guided extension
- Same quality and duration options as base model
Example Request
Seedance 2.0 Extend requires the
request_id from a previously generated Seedance 2.0 video. The extension will match the style and motion of the original.Common Input Parameters
Prompt
Type:stringRequired: Yes
Max Length: Varies by model (typically 500–3000 characters) Detailed text description of the video. Include camera movements, subject actions, lighting, atmosphere, and style. Example:
Aspect Ratio
Type:stringOptions: Varies by model (typically
16:9, 9:16, 1:1, 4:3, 3:4, 21:9)
Defines the width-to-height ratio of the generated video. Most models support 16:9 (landscape) and 9:16 (vertical).
Duration
Type:integerOptions: Varies by model (typically 4–25 seconds) Length of the generated video in seconds. Longer durations consume more credits and take longer to generate.
Some models have fixed durations (e.g., Vidu v2.0 only supports 4s), while others offer flexible options (e.g., Wan 2.6 supports 5s/10s/15s).
Resolution
Type:stringOptions:
480p, 720p, 1080p, 768P, etc.
Output video resolution. Higher resolutions provide better quality but consume more credits.
Quality
Type:stringOptions:
basic, medium, high
Some models (like Seedance and Wan) offer quality presets that affect generation time and fidelity.
Model-Specific Features
Sora 2 Pro — Long-Form Generation
Sora 2 Pro — Long-Form Generation
Durations: 10s, 15s, 25s
Resolutions: 720p, 1080p
Aspect Ratios: 16:9, 9:16Best for narrative storytelling and complex scene transitions.
Resolutions: 720p, 1080p
Aspect Ratios: 16:9, 9:16
25-second videos only support 720p resolution.
Runway Gen-3 — Cinematic Effects
Runway Gen-3 — Cinematic Effects
Durations: 5s, 8s
Resolutions: 720p, 1080p
Aspect Ratios: 16:9, 9:16, 1:1, 4:3, 3:4Excels at slow-motion, cinematic camera work, and dramatic lighting.
Resolutions: 720p, 1080p
Aspect Ratios: 16:9, 9:16, 1:1, 4:3, 3:4
8-second videos cannot use 1080p resolution. 1080p videos cannot be 8 seconds.
LTX 2 Models — Long Duration Open Source
LTX 2 Models — Long Duration Open Source
- LTX 2 Fast: 6–20 seconds (2s increments)
- LTX 2 Pro: 6, 8, 10 seconds
- LTX 2 19B: 5+ seconds with resolution control (480p/720p/1080p)
Seedance 2.0 Extend — Video Continuation
Seedance 2.0 Extend — Video Continuation
Requires:
Optional: Continuation promptThe model seamlessly continues the original video, preserving style and motion. If no prompt is provided, it continues the scene naturally.
request_id from original Seedance 2.0 generationOptional: Continuation prompt
Example Prompts
Cinematic Drone Shot
Cinematic Drone Shot
Character Action
Character Action
Product Demo
Product Demo
Fantasy Scene
Fantasy Scene
Duration & Resolution Guide
| Use Case | Recommended Duration | Resolution | Models |
|---|---|---|---|
| Social media clips | 5–8s | 720p–1080p | Seedance 2.0, Wan 2.6, Kling v2.6 Pro |
| YouTube shorts | 10–15s | 1080p | Sora 2 Pro, Veo 3.1, Seedance 2.0 |
| Product demos | 5–8s | 1080p | Runway Gen-3, Veo 3.1, Kling v2.6 Pro |
| Cinematic sequences | 10–25s | 720p–1080p | Sora 2 Pro, Seedance 2.0 |
| Quick prototypes | 4–6s | 480p–720p | Hailuo 2.3, Wan 2.2 Fast |
API Endpoints
All text-to-video models use the unified endpoint pattern:Model Selection Tips
Camera Movement Vocabulary
Include these terms in your prompt to guide camera motion:
- Dolly In/Out — Move camera toward/away from subject
- Pan Left/Right — Horizontal rotation
- Tilt Up/Down — Vertical rotation
- Zoom In/Out — Lens zoom (not camera movement)
- Crane Shot — Vertical movement (up/down)
- Tracking Shot — Follow subject’s movement
- Orbit/Arc — Circular movement around subject
- Dutch Angle — Tilted camera for dramatic effect
- Slow dolly — Smooth, cinematic
- Crash zoom — Fast, dramatic zoom
- Smooth tracking — Steadicam-like follow
Rate Limits
Video generation consumes significantly more credits than image generation through the Muapi.ai API. Cost factors include:- Duration: Longer videos cost more
- Resolution: Higher resolutions cost more
- Quality: High-quality modes cost more
- Model: Premium models (Sora, Runway) cost more than open-source options
Best Practices
- Start Short: Test with 5s generations before committing to longer durations.
- Be Specific: Describe camera movement, subject actions, and lighting in detail.
- Avoid Complex Edits: Text-to-video models work best with single continuous shots.
- Use Extend: For longer sequences, use Seedance 2.0 Extend to chain multiple clips.
- Resolution Trade-offs: Use 720p for iteration, 1080p for final output.
