Google Cloud Vertex AI offers free preview access to select Meta Llama models with generous rate limits.
Overview
Google Cloud Vertex AI provides access to enterprise-grade AI models through Google Cloud Platform. Select Llama models are available for free during their preview period.Rate Limits
All listed models are free during preview period.
| Model Name | Requests/Minute | Pricing |
|---|---|---|
| Llama 3.2 90B Vision Instruct | 30 | Free during preview |
| Llama 3.1 70B Instruct | 60 | Free during preview |
| Llama 3.1 8B Instruct | 60 | Free during preview |
Available Models
Llama 3.2 90B Vision
Multimodal model with vision capabilities
Llama 3.1 70B
Powerful 70B parameter model
Llama 3.1 8B
Efficient 8B parameter model
API Usage
Getting Started
Create Google Cloud Account
Sign up at cloud.google.com
Access Model Garden
Visit the Model Garden
Key Features
Enterprise Grade
Production-ready infrastructure
Free Preview
No cost during preview period
High Rate Limits
Up to 60 requests per minute
Vision Support
Multimodal capabilities with Llama 3.2
Global Infrastructure
Google’s worldwide network
Security
Enterprise security and compliance
Model Details
Llama 3.2 90B Vision Instruct
- Capabilities: Text and image understanding
- Rate Limit: 30 requests/minute
- Best For: Multimodal applications, image analysis
Llama 3.1 70B Instruct
- Capabilities: Advanced text generation and reasoning
- Rate Limit: 60 requests/minute
- Best For: Complex tasks, long-form content
Llama 3.1 8B Instruct
- Capabilities: Efficient text generation
- Rate Limit: 60 requests/minute
- Best For: Fast inference, cost-effective applications
Important Considerations
Use Cases
- Enterprise Applications: Build production-grade AI apps
- Multimodal Projects: Leverage vision capabilities
- Prototyping: Test Llama models at scale
- Migration: Evaluate before committing to paid tier
Additional Resources
Vertex AI Console
Access the platform
Model Garden
Browse available models
Documentation
Official documentation
Llama Models
Meta Llama on Vertex AI
