Troubleshooting

This guide helps you diagnose and resolve common issues when working with the SKU Semantic Search application.

Database issues

Connection refused or database not available

Symptoms:

Application fails to start
Error: connection refused or could not connect to server

Solutions:

Verify Docker container is running:
```
docker ps
```
You should see a PostgreSQL container running. If not, start it:
```
docker-compose up -d db
```
Check DATABASE_URL configuration:
- Verify the connection string format: postgresql://user:pass@host:port/database
- Confirm the port matches your Docker configuration (default: 5435)
- Ensure credentials match your docker-compose.yml

Test database connectivity:

psql postgresql://user:password@localhost:5435/skudb

Check Docker logs:
```
docker-compose logs db
```

pgvector extension not found

Symptoms:

Error: type "vector" does not exist
Error: extension "vector" does not exist

Solutions:

Verify pgvector is installed: Connect to your database and run:
```
SELECT * FROM pg_extension WHERE extname = 'vector';
```
Create the extension: The application attempts to create it automatically in app/main.py, but you can create it manually:
```
CREATE EXTENSION IF NOT EXISTS vector;
```
Use the correct Docker image: Ensure your docker-compose.yml uses a PostgreSQL image with pgvector:
```
image: ankane/pgvector:latest
```

The extension must be created before running migrations or creating tables with vector columns.

Vector dimension mismatch

Symptoms:

Error: expected 3072 dimensions, not X

Solutions:The Gemini embedding model produces 3072-dimensional vectors. Ensure:

Your database schema defines vector columns as vector(3072)
You’re using the correct embedding model: gemini-embedding-001
You haven’t mixed embeddings from different models

If you need to change embedding models, you must:

Drop existing embeddings
Update the vector column dimension
Regenerate all embeddings

API key and authentication issues

Invalid or missing API keys

Symptoms:

Error: API key not valid
Error: 401 Unauthorized from AI providers
Application startup validation error

Solutions:

Verify API keys are set:

# Check if variables are loaded
python -c "from app.core.config import settings; print(settings.GEMINI_API_KEY[:10])"

Validate API keys:
- Gemini: Visit Google AI Studio and verify your key
- Anthropic: Visit Anthropic Console and check your key status
Check for whitespace or quotes: In your .env file, ensure no extra spaces or quotes:
```
# Wrong
GEMINI_API_KEY = " key_here "

# Correct
GEMINI_API_KEY=key_here
```
Regenerate keys if necessary: If keys are compromised or invalid, generate new ones from the respective consoles.

JWT token errors

Symptoms:

Error: Could not validate credentials
Error: Signature verification failed
Users unexpectedly logged out

Solutions:

Verify JWT_SECRET is set:

python -c "from app.core.config import settings; print(len(settings.JWT_SECRET))"

Should return a number > 16 (ideally 32+)

Check for secret rotation: If you changed JWT_SECRET, all existing tokens become invalid. Users must log in again.
Verify token expiration: Check ACCESS_TOKEN_EXPIRE_MINUTES setting. Tokens expire after this duration.

Test token generation:

from app.core.security import Security
token = Security.create_access_token({"sub": "[email protected]"})
print(token)

LLM and embedding issues

LLM failover not working

Symptoms:

Application crashes when Gemini fails
No automatic fallback to Claude
Error messages about unavailable services

Solutions:

Verify both API keys are configured:

python -c "from app.core.config import settings; print('Gemini:', bool(settings.GEMINI_API_KEY)); print('Anthropic:', bool(settings.ANTHROPIC_API_KEY))"

Check LLM service configuration: The failover hierarchy is defined in app/services/llm_service.py:13-26. Verify the configuration:
- Google models are tried first
- Anthropic models are fallback
Review error handling: The service catches APIStatusError and APIConnectionError. Other exceptions may not trigger failover.
Check application logs: Look for warning messages indicating failover attempts:
```
⚠️ Error de red/status en google (models/gemini-2.5-flash): ...
```

Failover only occurs during answer generation. Embeddings always use Gemini and do not have a fallback.

Embedding generation fails

Symptoms:

Error: Error al generar embedding
Search returns no results
Products cannot be indexed

Solutions:

Verify Gemini API key: Embeddings require a valid GEMINI_API_KEY. There is no fallback.
Check API quota: You may have exceeded your free tier quota. Check Google AI Studio usage.
Validate input text:
- Text should not be empty
- Very long text (>10,000 characters) may fail
- Special characters are handled, but extreme cases may cause issues

Test embedding generation directly:

from app.services.llm_service import LLMService
embedding = LLMService.get_embedding("test product")
print(f"Embedding dimension: {len(embedding)}")

Should output: Embedding dimension: 3072

Rate limiting or quota exceeded

Symptoms:

Error: 429 Too Many Requests
Error: Quota exceeded
Intermittent failures during high usage

Solutions:

Check quota usage:
- Gemini: Google AI Studio
- Anthropic: Console Usage
Implement caching: Cache embeddings and LLM responses to reduce API calls:
- Store embeddings in database (already done for products)
- Cache search results for common queries
- Use Redis or similar for response caching
Upgrade your plan: Consider upgrading to paid tiers for higher quotas.
Implement rate limiting: Add rate limiting to your API endpoints to control request volume.

Free tier quotas are typically very limited. Production deployments should use paid plans with appropriate quotas.

Development environment issues

Import errors or missing dependencies

Symptoms:

Error: ModuleNotFoundError: No module named 'X'
Import errors when starting the application

Solutions:

Verify virtual environment is activated:

which python
# Should point to your venv/bin/python

Install dependencies:
```
pip install -r requirements.txt
```
Check Python version: The project requires Python 3.13+:
```
python --version
```

Reinstall dependencies: If issues persist, recreate the virtual environment:

rm -rf venv
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

CORS errors in browser

Symptoms:

Error: Access to fetch at ... has been blocked by CORS policy
Frontend cannot connect to API

Solutions:

Verify CORS configuration: Check app/main.py:17-23 for CORS middleware settings.

Add your frontend origin: Update allow_origins to include your frontend URL:

allow_origins=[
    "http://localhost:4200",  # Angular default
    "http://localhost:3000",  # React default
    "http://localhost:5173",  # Vite default
],

For development, allow all origins:
```
allow_origins=["*"],
```

Never use allow_origins=["*"] in production. Specify exact origins for security.

Application won't start or crashes immediately

Symptoms:

Application exits immediately after starting
Validation errors on startup
Cryptic error messages

Solutions:

Check for configuration errors: Most startup failures are due to missing or invalid configuration:
```
python -c "from app.core.config import settings"
```
This will show validation errors clearly.
Verify database connection: Test the database before starting the app:
```
psql $DATABASE_URL -c "SELECT 1;"
```

Check for port conflicts: If port 8000 is in use:

lsof -i :8000
# Kill the process or use a different port
uvicorn app.main:app --port 8001

Review application logs: Run with verbose logging:

uvicorn app.main:app --reload --log-level debug

Getting help

If you encounter issues not covered here:

Check application logs: Most errors include detailed stack traces
Enable debug mode: Set --log-level debug when running uvicorn
Verify configuration: Use the configuration validation script
Test components individually: Isolate the failing component (database, LLM, auth)

When reporting issues, include:

Error message and full stack trace
Configuration (with secrets redacted)
Steps to reproduce
Python and dependency versions

Resources

Database issues

API key and authentication issues

LLM and embedding issues

Development environment issues

Getting help

Build docs developers (and LLMs) love

Resources

​Database issues

​API key and authentication issues

​LLM and embedding issues

​Development environment issues

​Getting help

Build docs developers (and LLMs) love

Database issues

API key and authentication issues

LLM and embedding issues

Development environment issues

Getting help