Endpoint
This is a long-running operation that can take several minutes. Use the Progress endpoint to track generation status in real-time.
Request Body
The topic or subject for the presentation. This will be used to generate content, images, and animations.Example:
"Introduction to Machine Learning"The number of slides to generate in the presentation.Default:
5The language for voice narration.Default:
"english"The tone of the presentation narration.Default:
"formal"Response
Status of the generation request. Returns
"success" when completed.Human-readable message describing the result.
The generated presentation content structure including slides, titles, and content.Contains:
title: Presentation titleslides: Array of slide objects with titles, content, and visual specifications
The generated narration scripts with timestamps for each slide.Contains:
slide_scripts: Array of scripts withstart_time,end_time, andnarration_texttotal_duration: Total video duration in seconds
Full file system path to the generated video file.
The filename of the generated video. Use this with the Video endpoint to stream the video.
Generation Process
The endpoint performs the following steps:- Content Generation (10%): Creates presentation structure and slide content
- Script Generation (20%): Generates narration scripts with timestamps
- Audio Generation (30-49%): Generates voice narration for each slide
- Visual Generation (50-80%): Creates slides with text, images, or animations
- Video Composition (85-95%): Combines slides and audio into final video
- Completion (100%): Returns video information
Track progress in real-time using the Progress endpoint with the generation ID derived from the sanitized topic name.
Example Request
Example Response
Error Responses
500 Internal Server Error
Returned when video generation fails:Generation ID
The generation ID is derived from the topic by:- Taking the first 30 characters
- Replacing spaces with underscores
- Removing special characters (
:,/,\,",',?,!)
"Introduction to Machine Learning" becomes "Introduction_to_Machine_Learn"
This ID is used to track progress via the SSE endpoint.
Visual Types
Slides can have three types of visuals (mutually exclusive):- Animation: Generated using Manim for mathematical/technical concepts
- Image: Fetched from external sources based on keywords
- Text-only: Default slide with styled text content
Each slide will have only ONE visual type. If both animation and image are flagged, animation takes priority.