Overview
GitHub Models offers rate-limited access to various state-of-the-art models:- Free access with your GitHub account
- Multiple model providers (OpenAI, Meta, DeepSeek, and more)
- Rate-limited but generous for development use
- No separate subscription needed if you use GitHub
GitHub Models is free but rate-limited. For high-volume usage, consider direct API providers.
Available Models
GitHub provides access to various models:OpenAI Models
- gpt-4o ⭐ (default) - OpenAI’s flagship model
Meta Llama Models
- Llama-3.3-70B-Instruct - Powerful open model
DeepSeek Models
- DeepSeek-R1 - Reasoning-focused model
- Deepseek-V3 - Advanced DeepSeek model
Other Models
- Phi-4-mini-instruct - Microsoft’s efficient model
- Codestral 25.01 - Mistral’s code model
- Mistral Large 24.11 - Mistral’s largest model
Browse all available models at github.com/marketplace?type=models
Setup
1. Create a GitHub Personal Access Token (PAT)
You need a GitHub PAT to authenticate:Go to GitHub Settings
Configure Token
- Give it a descriptive name (e.g., “GitWhisper”)
- Set expiration (recommended: 90 days or No expiration)
- Select scopes:
repo(if private repos) or no scopes (for public use)
2. Configure GitWhisper
Save your GitHub PAT:- Save Permanently
- Environment Variable
- Command Line
~/.git_whisper.yamlUsage
Basic Usage
Use GitHub Models with default model (gpt-4o):Specific Model
Choose a specific model:Set as Default
Make GitHub Models your default:Rate Limits
GitHub Models has rate limits that vary by model and your GitHub account type:- Free Tier
- GitHub Pro
- Enterprise
Typical limits for free GitHub accounts:
- Requests per minute: 10-15
- Requests per hour: 50-150
- Tokens per request: Varies by model
- Individual development
- Small projects
- Testing and learning
Model Comparison
- By Quality
- By Use Case
| Model | Quality | Speed | Best For |
|---|---|---|---|
| gpt-4o | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐ | General purpose |
| Llama-3.3-70B | ⭐⭐⭐⭐ | ⭐⭐⭐ | Open source preference |
| DeepSeek-R1 | ⭐⭐⭐⭐ | ⭐⭐⭐ | Reasoning tasks |
| Codestral | ⭐⭐⭐⭐ | ⭐⭐⭐⭐ | Code-focused |
| Phi-4-mini | ⭐⭐⭐ | ⭐⭐⭐⭐⭐ | Fast responses |
Code Analysis
Use GitHub Models for code analysis:Advantages
Free Access
No separate API subscription needed if you use GitHub
Multiple Models
Access various providers through one API
Easy Setup
Use your existing GitHub account
Regular Updates
GitHub adds new models regularly
Limitations
Rate Limits
Limited requests per hour - may not suit high-volume use
Model Availability
Some models may have waiting lists or limited access
No Guaranteed SLA
Free tier has no service level guarantees
Account Required
Need GitHub account and PAT
Troubleshooting
Invalid Token
Invalid Token
- Check token hasn’t expired
- Ensure token has correct scopes
- Regenerate if necessary:
Rate Limit Exceeded
Rate Limit Exceeded
- Wait for limit reset (usually hourly)
- Use a different model temporarily:
- Upgrade to GitHub Pro for higher limits
Model Not Available
Model Not Available
Access Denied
Access Denied
- Check if model requires waitlist approval
- Verify your GitHub account status
- Try a different model
Comparison with Other Options
- vs Free Model
- vs Direct APIs
- vs Ollama
| Feature | GitHub Models | Free Model |
|---|---|---|
| Setup | PAT required | None |
| Quality | Excellent | Good |
| Rate Limits | Generous | 60/hour |
| Model Choice | Multiple | Single |
| Requirements | GitHub account | None |
Best Practices
- Use appropriate models: Choose models based on your task
- Monitor rate limits: Keep track of your usage
- Secure your PAT: Never commit tokens to repositories
- Rotate tokens: Regularly update your PAT
- Have alternatives: Know how to switch models if limits hit
Example Workflow
Next Steps
View All Models
Browse GitHub Models marketplace
GitHub Docs
Official GitHub Models documentation
Compare Models
See all GitWhisper model options
API Keys
Learn about API key management