Prerequisites
Before you begin, ensure you have the following installed:- Python 3.8+ with pip
- Node.js 16+ with npm
- FFmpeg (for video processing)
- Git (to clone the repository)
- Google AI Studio (for Gemini API)
- Sarvam AI (for text-to-speech)
- Unsplash (for images)
Get Started in 5 Minutes
Set up the backend
Navigate to the backend directory and create a virtual environment:Activate the virtual environment:Install dependencies:Install Manim (animation library):
Configure environment variables
Create a Edit
.env file in the backend/ directory:.env and add your API keys:The backend uses Sarvam AI for text-to-speech. Make sure to get your API key from Sarvam AI.
Start the backend server
With your virtual environment still activated, run:The FastAPI backend will start on
http://localhost:8000. You should see:Set up the frontend
Open a new terminal and navigate to the frontend directory:Install dependencies:Start the development server:The React frontend will start on
http://localhost:5173.Generate your first presentation
- Open your browser and go to
http://localhost:5173 - Enter a topic (e.g., “Introduction to Machine Learning”)
- Set the number of slides (e.g., 5)
- Choose a language (English, Hindi, Kannada, or Telugu)
- Select a tone (Formal, Casual, or Educational)
- Click Generate
- Generate presentation content using Gemini AI
- Create narration scripts for each slide
- Generate voice audio using Sarvam AI
- Fetch relevant images from Unsplash or create animations with Manim
- Compose everything into a final MP4 video
The generation process takes 2-5 minutes depending on the number of slides and complexity.
What’s Next?
Installation Guide
Detailed installation instructions for all platforms
Configuration
Learn about advanced configuration options
API Reference
Explore the FastAPI endpoints
Creating Presentations
Learn how to create and customize presentations
Troubleshooting
Backend server won't start
Backend server won't start
Make sure:
- Your virtual environment is activated
- All dependencies are installed:
pip install -r requirements.txt - FFmpeg is installed and accessible in your PATH
- Port 8000 is not already in use
Frontend can't connect to backend
Frontend can't connect to backend
- Verify the backend is running on
http://localhost:8000 - Check browser console for CORS errors
- Ensure your
.envfile has the correct API keys
Video generation fails
Video generation fails
Common issues:
- Invalid API keys in
.env - FFmpeg not installed or not in PATH
- Insufficient disk space in
backend/outputs/ - API rate limits exceeded