Google Gemini Setup

Google Gemini is Cluely’s default AI provider, offering the latest AI technology with native vision capabilities and fastest response times.

Features

Latest AI Technology: Gemini 2.5 Flash with advanced reasoning
Native Vision: Direct image and screenshot analysis
Audio Processing: Built-in transcription and audio analysis
Fast Responses: Cloud-optimized for minimal latency
Automatic Fallback: Switch to backup API key on rate limits

Setup

Get API Key

Visit Google AI Studio and create a new API key.

You can create multiple API keys for fallback protection against rate limits.

Configure Environment

Add your API key(s) to the .env file in the project root:

GEMINI_API_KEY=your_primary_api_key_here
GEMINI_FALLBACK_API_KEY=your_backup_api_key_here

Keep your API keys secure. Never commit .env files to version control.

Verify Configuration

Start Cluely and check the console for:

[LLMHelper] Using Google Gemini

Configuration Options

Environment Variables

Variable	Required	Description	Default
`GEMINI_API_KEY`	Yes*	Primary API key	-
`GEMINI_FALLBACK_API_KEY`	No	Backup API key for rate limits	-

*Required unless using Ollama, OpenRouter, or K2 Think V2

Default Model

Cluely uses models/gemini-2.5-flash by default for optimal balance of speed and quality.

// Default configuration (source/electron/LLMHelper.ts:26)
private geminiModel: string = "models/gemini-2.5-flash"

Switching to Gemini at Runtime

// Switch from another provider to Gemini
await llmHelper.switchToGemini(apiKey, model)

// Or use default configuration
await llmHelper.switchToGemini()

Features in Detail

Automatic Fallback

When rate limits are hit, Cluely automatically switches to your fallback API key:

// Automatic fallback logic (source/electron/LLMHelper.ts:139-145)
if (isRateLimitError && !this.usingFallbackKey && this.fallbackGeminiApiKey) {
  console.log(`[LLMHelper] Rate limit hit, switching to fallback API key...`);
  this.geminiApiKey = this.fallbackGeminiApiKey;
  this.model = new GoogleGenAI({ apiKey: this.fallbackGeminiApiKey });
  this.usingFallbackKey = true;
}

Set up a fallback key to avoid interruptions during high usage periods.

Error Handling

Cluely includes retry logic with exponential backoff for transient errors:

Rate Limits
Service Overload

Automatically retries with fallback API key when available.Detected error patterns:

HTTP 429 status
“quota” in error message
“RATE_LIMIT” or “RESOURCE_EXHAUSTED” errors

Vision Capabilities

Gemini provides native image analysis for screenshots:

// Image analysis example
const imageParts = await Promise.all(
  imagePaths.map(path => this.fileToGenerativePart(path))
)

const result = await this.generateContentWithRetry([
  { parts: [{ text: prompt }, ...imageParts] }
])

Supported Features:

Screenshot analysis for coding problems
Error message interpretation
Document and presentation parsing
Multi-image context understanding

Audio Processing

Gemini handles audio transcription and analysis:

// Audio analysis example
const audioPart = {
  inlineData: {
    data: audioData.toString("base64"),
    mimeType: "audio/mp3"
  }
}

Supported Formats:

MP3 audio files
Base64-encoded audio streams

Best Practices

API Key Management

Use separate API keys for development and production
Set up fallback keys to handle rate limits
Monitor usage in Google AI Studio
Rotate keys periodically for security

Rate Limit Optimization

Configure GEMINI_FALLBACK_API_KEY for automatic failover
Monitor console logs for rate limit warnings
Consider Ollama for high-volume local usage

Error Monitoring

Watch for these console messages:

# Success
[LLMHelper] Using Google Gemini

# Fallback activated
[LLMHelper] Rate limit hit, switching to fallback API key...

# Retry attempt
[LLMHelper] Model overloaded, retrying in 1000ms... (attempt 1/3)

Voice Features

Voice features always use Gemini, even when other providers are configured for text chat.

Cluely uses a dedicated Gemini client for voice operations:

// Separate client for voice (source/electron/LLMHelper.ts:36)
private geminiVoiceClient: GoogleGenAI | null = null

This ensures reliable voice transcription and response generation regardless of your primary AI provider.

Troubleshooting

API Key Not Found

Error: Either provide Gemini API key, enable Ollama mode, enable K2 Think, or provide OpenRouter API keySolution:

Verify .env file exists in project root
Check GEMINI_API_KEY is set correctly
Restart the application after updating .env

Rate Limit Errors

Error: 429 or RATE_LIMIT_EXCEEDEDSolution:

Set up GEMINI_FALLBACK_API_KEY in .env
Wait for quota to reset (usually within minutes)
Consider upgrading to higher tier in Google AI Studio

Empty Responses

Error: Voice or image analysis returns empty resultsSolution:

Check API key has sufficient quota
Verify image format is supported (PNG recommended)
Ensure audio is in MP3 format
Check console for detailed error messages

Next Steps

Ollama Setup

Configure local AI for privacy-first usage

OpenRouter Setup

Access multiple AI models through one API

Get Started

Core Features

AI Providers

Guides

Google Gemini Setup

Features

Setup

Configuration Options

Environment Variables

Default Model

Switching to Gemini at Runtime

Features in Detail

Automatic Fallback

Error Handling

Vision Capabilities

Audio Processing

Best Practices

Voice Features

Troubleshooting

Next Steps

Ollama Setup

OpenRouter Setup

Build docs developers (and LLMs) love

Get Started

Core Features

AI Providers

Guides

​Features

​Setup

​Configuration Options

​Environment Variables

​Default Model

​Switching to Gemini at Runtime

​Features in Detail

​Automatic Fallback

​Error Handling

​Vision Capabilities

​Audio Processing

​Best Practices

​Voice Features

​Troubleshooting

​Next Steps

Ollama Setup

OpenRouter Setup

Build docs developers (and LLMs) love

Features

Setup

Configuration Options

Environment Variables

Default Model

Switching to Gemini at Runtime

Features in Detail

Automatic Fallback

Error Handling

Vision Capabilities

Audio Processing

Best Practices

Voice Features

Troubleshooting

Next Steps