Features
- Latest AI Technology: Gemini 2.5 Flash with advanced reasoning
- Native Vision: Direct image and screenshot analysis
- Audio Processing: Built-in transcription and audio analysis
- Fast Responses: Cloud-optimized for minimal latency
- Automatic Fallback: Switch to backup API key on rate limits
Setup
Get API Key
Visit Google AI Studio and create a new API key.
You can create multiple API keys for fallback protection against rate limits.
Configuration Options
Environment Variables
| Variable | Required | Description | Default |
|---|---|---|---|
GEMINI_API_KEY | Yes* | Primary API key | - |
GEMINI_FALLBACK_API_KEY | No | Backup API key for rate limits | - |
*Required unless using Ollama, OpenRouter, or K2 Think V2
Default Model
Cluely usesmodels/gemini-2.5-flash by default for optimal balance of speed and quality.
Switching to Gemini at Runtime
Features in Detail
Automatic Fallback
When rate limits are hit, Cluely automatically switches to your fallback API key:Set up a fallback key to avoid interruptions during high usage periods.
Error Handling
Cluely includes retry logic with exponential backoff for transient errors:- Rate Limits
- Service Overload
Automatically retries with fallback API key when available.Detected error patterns:
- HTTP 429 status
- “quota” in error message
- “RATE_LIMIT” or “RESOURCE_EXHAUSTED” errors
Vision Capabilities
Gemini provides native image analysis for screenshots:- Screenshot analysis for coding problems
- Error message interpretation
- Document and presentation parsing
- Multi-image context understanding
Audio Processing
Gemini handles audio transcription and analysis:- MP3 audio files
- Base64-encoded audio streams
Best Practices
API Key Management
API Key Management
- Use separate API keys for development and production
- Set up fallback keys to handle rate limits
- Monitor usage in Google AI Studio
- Rotate keys periodically for security
Rate Limit Optimization
Rate Limit Optimization
- Configure
GEMINI_FALLBACK_API_KEYfor automatic failover - Monitor console logs for rate limit warnings
- Consider Ollama for high-volume local usage
Error Monitoring
Error Monitoring
Watch for these console messages:
Voice Features
Voice features always use Gemini, even when other providers are configured for text chat.
Troubleshooting
API Key Not Found
API Key Not Found
Error:
Either provide Gemini API key, enable Ollama mode, enable K2 Think, or provide OpenRouter API keySolution:- Verify
.envfile exists in project root - Check
GEMINI_API_KEYis set correctly - Restart the application after updating
.env
Rate Limit Errors
Rate Limit Errors
Error:
429 or RATE_LIMIT_EXCEEDEDSolution:- Set up
GEMINI_FALLBACK_API_KEYin.env - Wait for quota to reset (usually within minutes)
- Consider upgrading to higher tier in Google AI Studio
Empty Responses
Empty Responses
Error: Voice or image analysis returns empty resultsSolution:
- Check API key has sufficient quota
- Verify image format is supported (PNG recommended)
- Ensure audio is in MP3 format
- Check console for detailed error messages
Next Steps
Ollama Setup
Configure local AI for privacy-first usage
OpenRouter Setup
Access multiple AI models through one API