Overview
DecipherIt transforms your research into engaging podcast-style audio overviews. Using CrewAI agents and LemonFox AI text-to-speech, it creates natural conversations between two hosts discussing your research findings.Audio overviews are generated on-demand and typically ready in 2-4 minutes. The voice and content are AI-generated and may contain inaccuracies or audio glitches.
How It Works
Content Preparation
All research content is retrieved from the vector database.Content Retrieval:
- All chunks for the notebook fetched from Qdrant
- Content sorted by chunk index
- Assembled into complete research text
- Passed to audio generation crew
backend/agents/audio_overview_agent.py:107-115Research Analysis
A Research Analyst agent extracts and organizes key insights.Analysis Tasks:
- Identifies main themes and key insights
- Highlights important supporting details
- Maintains factual accuracy
- Organizes points logically for discussion
backend/agents/audio_overview_agent.py:37-52Conversation Planning
A Podcast Producer agent structures insights into a conversation outline.Planning Process:
- Designs 4-5 minute conversation flow
- Creates natural transitions between topics
- Balances thoroughness with brevity
- Plans for Michael (host) and Sarah (expert)
backend/agents/audio_overview_agent.py:54-71Script Writing
A Scriptwriter agent crafts natural podcast dialogue.Script Requirements:
- Opens with “The DecipherIt Podcast” welcome
- 800-1000 words (4-5 minutes)
- Casual, natural dialogue
- Authentic reactions and interjections
- Meaningful back-and-forth discussion
backend/agents/audio_overview_agent.py:73-97Text-to-Speech Conversion
LemonFox AI converts the script to audio with different voices.TTS Process:
- Michael: “liam” voice (host)
- Sarah: “jessica” voice (expert)
- Segments processed concurrently (max 5 at once)
- 0.5 second pause between segments
- Combined into single MP3 file
backend/services/tts_service.py:84-191Generating Audio Overviews
- First Generation
- Playback
- Regeneration
- Navigate to your processed notebook
- Click the Audio Overview tab
- Click Generate Audio Overview button
- Wait 2-4 minutes for processing
- Audio player appears when ready
The page automatically polls for completion, so you don’t need to refresh.
CrewAI Audio Generation Workflow
Agent Configuration
backend/agents/audio_overview_agent.py:7-34
Script Output Format
backend/models/audio_overview_models.py
Text-to-Speech Implementation
TTS Service Architecture
backend/services/tts_service.py:16-42
Concurrent Generation
backend/services/tts_service.py:84-133
Audio Combination
backend/services/tts_service.py:151-185
UI Implementation
Status Management
The audio overview component tracks generation status:client/components/notebook/audio-overview-section.tsx:19-22
Polling for Completion
client/components/notebook/audio-overview-section.tsx:25-68
Audio Quality Features
Natural Voices
High-quality AI voices (Liam and Jessica) create authentic-sounding podcast conversations.
Conversational Flow
Script includes natural reactions, interjections, and back-and-forth discussion for engagement.
Optimized Length
4-5 minute duration balances comprehensiveness with listenability.
Professional Production
Automatic pausing between segments and smooth transitions create polished output.
Use Cases
On-the-Go Learning
On-the-Go Learning
Listen to research summaries while:
- Commuting
- Exercising
- Doing chores
- Taking breaks
Accessibility
Accessibility
Audio format provides:
- Alternative to reading long summaries
- Accessibility for visual impairments
- Multi-modal learning options
- Reduced screen time
Content Review
Content Review
Use audio for:
- Quick refreshers on research
- Pre-presentation review
- Sharing insights with colleagues
- Multi-tasking while learning
Performance Optimizations
Concurrent TTS
Up to 5 segments processed simultaneously for faster generation.
HTTP/2 Support
Connection pooling and HTTP/2 reduce API call overhead.
Memory Efficiency
Context managers ensure proper cleanup of audio buffers during processing.
Progressive Updates
UI polls every 3 seconds to show progress without blocking.
Technical Details
Connection Pooling
backend/services/tts_service.py:44-60
Rate Limiting
backend/services/tts_service.py:42, 139-149
Limitations
Best Practices
Related Features
AI Summaries
Read the full written summary
Interactive Q&A
Ask questions about specific details
Mindmaps
Visual representation of research structure