Quick Start
Get h2oGPT running in minutes with pip or Docker
Docker Install
Recommended for full capabilities on Linux, Windows, and macOS
Document Q&A
Chat with PDFs, Word docs, spreadsheets, images, and more
API Reference
OpenAI-compatible REST API for chat, embeddings, audio, and images
Key capabilities
Chat UI
Gradio-based UI with streaming, multi-model bake-off, and saved chats
Vision & Images
Understand images with LLaVA and generate images with Stable Diffusion
Voice STT / TTS
Whisper speech-to-text and Microsoft SpeechT5 text-to-speech
Agents
Autonomous agents for web search, Python code, CSV analysis, and more
Model Backends
oLLaMa, vLLM, llama.cpp, GPT4All, HF TGI, and many more
Fine-tuning
Fine-tune models with LoRA on your own data
Get started in three steps
Install h2oGPT
The fastest path is pip install. For full capabilities including vision, audio, and image generation, use Docker.
Launch the server
Run
generate.py pointing at any supported model. h2oGPT starts a Gradio UI and an OpenAI-compatible API server automatically.h2oGPT is 100% private — all inference runs locally. No data is sent to external servers unless you explicitly configure a remote inference backend.