Available Models
Anthropic Claude on Bedrock
Claude 4 Series (Latest)
- anthropic.claude-opus-4-6-v1:0 - Most capable (200K context)
- anthropic.claude-opus-4-1-20250805-v1:0 - Enhanced flagship (200K)
- anthropic.claude-sonnet-4-5-20250929-v1:0 - Balanced (200K)
- anthropic.claude-sonnet-4-20250514-v1:0 - Efficient (200K)
- anthropic.claude-haiku-4-5-20251001-v1:0 - Fast (200K)
Claude 3 Series
- anthropic.claude-3-7-sonnet-20250219-v1:0 - Enhanced (200K)
- anthropic.claude-3-5-sonnet-20241022-v2:0 - Intelligent (200K)
- anthropic.claude-3-5-haiku-20241022-v1:0 - Fast (200K)
- anthropic.claude-3-opus-20240229-v1:0 - Powerful (200K)
Amazon Nova (Latest)
- us.amazon.nova-premier-v1:0 - Premier multimodal (300K)
- us.amazon.nova-pro-v1:0 - High-capability (300K)
- us.amazon.nova-lite-v1:0 - Fast and cost-effective (300K)
- us.amazon.nova-micro-v1:0 - Ultra-fast text-only (128K)
Meta Llama
- meta.llama3-1-405b-instruct-v1:0 - Largest open model (128K)
- meta.llama3-1-70b-instruct-v1:0 - Powerful (128K)
- meta.llama3-2-90b-instruct-v1:0 - Multimodal (128K)
- meta.llama3-2-11b-instruct-v1:0 - Multimodal (128K)
Other Providers
- Cohere Command R/R+ - RAG-optimized (128K)
- Mistral Large/Small - European AI (32K)
- Amazon Titan - Amazon’s models (8K-300K)
Prerequisites
Before using Bedrock with Forge:- AWS Account: Active account with billing enabled
- Model Access: Request access to models in Bedrock console
- Bedrock Access Gateway: Deploy gateway infrastructure
- API Gateway: CloudFormation stack with API key
Setup Steps
Step 1: Deploy Bedrock Access Gateway
AWS Bedrock doesn’t provide a simple REST API. You need to deploy the Bedrock Access Gateway:Create API Key in Secrets Manager
- Go to AWS Secrets Manager
- Create a new secret
- Choose “Other type of secret”
- Add key-value pair:
apiKey= your chosen API key - Name it (e.g.,
bedrock-api-key) - Note the ARN for next step
Deploy CloudFormation Stack
Follow the instructions in the gateway repository to:
- Package the Lambda function
- Deploy the CloudFormation template
- Pass your Secrets Manager ARN
- Wait for stack creation to complete
Get API Base URL
After deployment completes:
- Go to CloudFormation Console
- Select your stack
- Go to “Outputs” tab
- Copy the
ApiUrlvalue (your API base URL)
Step 2: Request Model Access
Open Bedrock Console
Navigate to AWS Bedrock in your region.
Step 3: Configure Forge
Select OpenAI-Compatible Provider
Since Bedrock Gateway provides OpenAI-compatible API:
- Select “OpenAI-Compatible” from the list
- Enter your API base URL from Step 1
- Enter the API key from Secrets Manager
Configure Model
Set your default model in Note: Use the full Bedrock model ID (with vendor prefix)
forge.yaml:Configuration
API Endpoint
Your Bedrock Access Gateway URL format:Model IDs
Use the full Bedrock model identifier:anthropic.claude-sonnet-4-20250514-v1:0us.amazon.nova-pro-v1:0meta.llama3-1-70b-instruct-v1:0
Model Selection
For Coding and Agents
Best Performance:anthropic.claude-opus-4-6-v1:0- Most capableanthropic.claude-sonnet-4-5-20250929-v1:0- Balanced
anthropic.claude-haiku-4-5-20251001-v1:0- Fast and cheapus.amazon.nova-lite-v1:0- Cost-effective
For Maximum Context
- 300K: Amazon Nova models
- 200K: Claude 3 and 4 models
- 128K: Llama 3.1/3.2, Cohere Command R
For Multimodal Tasks
- Claude 4 - Excellent vision capabilities
- Amazon Nova Pro/Premier - High-quality multimodal
- Llama 3.2 90B/11B - Open-source multimodal
Switching Models
Regional Deployment
Bedrock is available in multiple AWS regions: US Regions:us-east-1(N. Virginia) - Most modelsus-west-2(Oregon) - Most models
eu-west-1(Ireland)eu-central-1(Frankfurt)
ap-southeast-1(Singapore)ap-northeast-1(Tokyo)
Features
Enterprise Security
- VPC Endpoints: Private connectivity
- IAM Policies: Fine-grained access control
- CloudTrail: Full audit logging
- KMS Encryption: Customer-managed keys
Compliance
- SOC 2
- HIPAA
- GDPR
- PCI DSS
- FedRAMP (select regions)
AWS Integration
- CloudWatch: Monitoring and metrics
- S3: Data storage and retrieval
- Lambda: Serverless integration
- SageMaker: ML pipeline integration
Best Practices
Security
Access Control:- Use IAM roles with least privilege
- Rotate API keys regularly
- Enable CloudTrail logging
- Use VPC endpoints for private access
Cost Optimization
Model Selection:- Use Haiku/Nova Lite for simple tasks
- Use Sonnet for balanced workloads
- Reserve Opus/Premier for complex tasks
- Use Lambda provisioned concurrency if needed
- Monitor API Gateway costs
- Set up billing alerts
- Minimize context size
- Cache common prompts
- Use streaming for long responses
Rate Limits
Bedrock has:- Requests per minute: Varies by model and region
- Tokens per minute: Varies by model
- Concurrent requests: Service quotas apply
Troubleshooting
Gateway Deployment Fails
If CloudFormation deployment fails:- Check you have sufficient IAM permissions
- Verify Secrets Manager ARN is correct
- Check CloudFormation events for specific errors
- Ensure Lambda execution role has Bedrock access
Model Access Denied
If you can’t access a model:- Verify model access was granted in Bedrock console
- Check you’re using the correct model ID
- Ensure your region supports the model
- Wait a few minutes after granting access
API Key Authentication Failed
If authentication fails:- Verify API key matches Secrets Manager
- Check API Gateway is deployed correctly
- Test the gateway URL directly
- Review CloudWatch logs for errors
Quota Exceeded
If you hit limits:- Check current quotas in Service Quotas Console
- Request quota increase
- Implement exponential backoff
- Distribute load across regions
Region Not Supported
If model isn’t available:- Check model availability by region
- Switch to a supported region
- Redeploy gateway in new region
- Update Forge configuration
Deprecated: Environment Variable Setup
For backward compatibility with the gateway:Cost Comparison
Bedrock pricing varies by model: Claude Models:- Similar pricing to direct Anthropic API
- No additional Bedrock fees
- API Gateway costs apply ($3.50/million requests)
- Lambda costs apply (minimal for this use case)
- Competitive pricing
- No API Gateway fees when using direct SDK
- Open-source models, lower cost
- Pay only for infrastructure
Next Steps
- Review Bedrock documentation
- Set up CloudWatch monitoring
- Configure VPC endpoints for production
- Explore Bedrock Agents for advanced workflows