How it works
The chat feature provides a direct text interface to Cluely’s language models, supporting multiple AI providers:- Google Gemini (default) - Fast, capable vision and language model
- Ollama - Local, privacy-focused models
- OpenRouter - Access to various cloud models
- K2 Think - Specialized reasoning models
Starting a chat
Access the chat interface through the Solutions view after processing screenshots or voice input. The chat maintains context from your current session:Chat with context
The AI assistant uses the same system prompt across all features:All responses are formatted in Markdown with support for LaTeX equations, code blocks, and structured formatting.
Response formatting
Code blocks
Code in responses is automatically syntax-highlighted:Mathematical equations
Math is rendered using LaTeX:- Inline:
$E=mc^2$→ - Block:
$$x = \frac{-b \pm \sqrt{b^2-4ac}}{2a}$$
Structured information
Responses use headers, lists, and bold text for clarity:AI provider management
Checking current provider
See which AI provider is currently active:Switching providers
- Gemini
- Ollama
- OpenRouter
- K2 Think
Testing connection
Verify your AI provider is working:Local models with Ollama
Setup
- Install Ollama from ollama.ai
- Pull a model:
ollama pull llama3.2 - Set environment variables:
Auto-detection
Cluely automatically detects available Ollama models:Benefits
Privacy
All processing happens locally on your machine
No API costs
Use unlimited tokens without spending money
Offline support
Works without internet connection
Model choice
Choose from dozens of open-source models
Error handling
Rate limiting
Cluely automatically handles rate limits with exponential backoff:Configure a fallback API key with
GEMINI_FALLBACK_API_KEY to automatically switch when rate limits are hit.Model overload
When models are overloaded (503 errors), Cluely retries with exponential backoff up to 3 times.Best practices
Write clear questions
Write clear questions
Be specific about what you need:
- ❌ “Fix this code”
- ✅ “Why does this function return undefined instead of the sum?”
Provide context
Provide context
Share relevant details:
- Language/framework you’re using
- What you’ve already tried
- Error messages you’re seeing
Break down complex problems
Break down complex problems
For multi-step problems:
- Ask one question at a time
- Build on previous responses
- Verify understanding before moving forward
Choose the right provider
Choose the right provider
- Gemini: Best for vision + chat, fast responses
- Ollama: Best for privacy, offline work
- OpenRouter: Best for accessing specific models
- K2 Think: Best for complex reasoning tasks
Environment configuration
Set these in your.env file: