H2O-3 requires Java 8 or later on the machine where the cluster runs. See Installation for full prerequisites.
Train a GBM model
Initialize the H2O cluster
h2o.init() starts a local H2O-3 server if one is not already running, then connects to it. With no arguments, it starts on localhost:54321 and uses all available CPU cores.Load data
Import the iris dataset from the H2O public test data bucket.
h2o.import_file reads data in parallel and distributes it across the cluster.Split into training and test sets
Use
split_frame to create an 80/20 train-test split with a fixed seed for reproducibility.Train a GBM model
Train a Gradient Boosting Machine for multiclass classification on the
class column.AutoML quickstart
H2O AutoML automatically trains and tunes many models — including GBMs, XGBoost, Random Forests, Deep Learning, GLMs, and Stacked Ensembles — and ranks them on a leaderboard. Use it when you want the best model without manually tuning hyperparameters.Run AutoML
Set
max_models to control how many models to train. AutoML builds base models and stacked ensembles, then ranks them by AUC on the leaderboard.AutoML stops when
max_models is reached or when the optional max_runtime_secs wall-clock limit expires, whichever comes first.Next steps
Installation
Detailed install instructions for Python, R, conda, and the standalone jar.
Introduction
Architecture overview, supported algorithms, and multi-language API.
Algorithm reference
Deep dive into every algorithm, its parameters, and when to use it.
AutoML
Customize AutoML: exclude algorithms, set stopping criteria, add preprocessing.