Algorithm Overview

H2O-3 provides a broad set of distributed machine learning algorithms covering supervised learning (regression and classification), unsupervised learning (clustering and dimensionality reduction), and meta-learning (ensembles). All algorithms run on H2O’s distributed in-memory computing engine and export to MOJO/POJO for production scoring.

Supported Algorithms

Algorithm	Class (Python)	Task	MOJO
Gradient Boosting Machine	`H2OGradientBoostingEstimator`	Regression, Classification	Yes
XGBoost	`H2OXGBoostEstimator`	Regression, Classification	Yes
Distributed Random Forest	`H2ORandomForestEstimator`	Regression, Classification	Yes
Deep Learning	`H2ODeepLearningEstimator`	Regression, Classification	Yes
Generalized Linear Model	`H2OGeneralizedLinearEstimator`	Regression, Classification	Yes
Generalized Additive Model	`H2OGeneralizedAdditiveEstimator`	Regression, Classification	Yes
Stacked Ensembles	`H2OStackedEnsembleEstimator`	Regression, Classification	Yes
K-Means	`H2OKMeansEstimator`	Clustering	Export only
PCA	`H2OPrincipalComponentAnalysisEstimator`	Dimensionality Reduction	Export only
Naive Bayes	`H2ONaiveBayesEstimator`	Classification	Yes
Isolation Forest	`H2OIsolationForestEstimator`	Anomaly Detection	Yes
AutoML	`H2OAutoML`	All supervised	—

Choosing the Right Algorithm

By Problem Type

Regression
Binary Classification
Multiclass Classification
Clustering

For predicting a continuous numeric value:

GBM / XGBoost — best general-purpose accuracy; XGBoost can be faster on large tabular data with GPU support.
GLM — best when you need a linear, interpretable model or a regularized baseline (elastic net with Gaussian family).
Deep Learning — useful for very large datasets or complex feature interactions, but requires more tuning.
DRF — strong out-of-the-box baseline; naturally handles missing values and mixed types.
AutoML — try all of the above automatically and rank by RMSE/deviance.

For predicting a two-class outcome (0/1, yes/no):

GBM / XGBoost — top performers on tabular data; tune max_depth, learn_rate, ntrees.
GLM (binomial) — logistic regression with elastic net regularization; highly interpretable.
DRF — robust and fast; good starting point.
Deep Learning — competitive on large datasets; use dropout for regularization.
Stacked Ensembles — combine the above for maximum AUC.

For predicting one of three or more classes:

GBM / XGBoost — strong defaults; distribution="multinomial".
GLM (multinomial) — linear baseline; fast to train.
Deep Learning — handles high cardinality well via embedding.
DRF — builds one tree per class; robust.

By Data Characteristics

Characteristic	Recommended Algorithm(s)
Small dataset (< 10k rows)	GLM, DRF, GBM
Large dataset (> 1M rows)	XGBoost (GPU), GBM, DRF
Many categorical features	GBM (native enum encoding), DRF
Sparse / high-dimensional data	GLM (lasso), Deep Learning
Time-series / sequential	GBM with monotone constraints
Need fast inference (MOJO)	GBM, XGBoost, DRF, GLM
Need model explainability	GLM, GBM (SHAP contributions), DRF (variable importance)
Missing values	GBM, DRF (handle natively)

By Interpretability Need

H2O-3 supports SHAP (Shapley Additive Explanations) contributions for tree-based models (GBM, XGBoost, DRF) and variable importance for all supervised models. Use model.explain() on any trained model or AutoML object for an automatic explainability report.

Interpretability Level	Algorithms
Fully interpretable	GLM, GAM
Variable importance	GBM, XGBoost, DRF, Deep Learning
SHAP contributions	GBM, XGBoost, DRF
Black-box	Deep Learning, Stacked Ensembles

Algorithm Pages

AutoML

Automatically train and rank multiple models with a single function call.

GBM & XGBoost

Gradient boosted trees — H2O’s native GBM and the XGBoost backend.

Distributed Random Forest

Bagged ensembles of decision trees with column and row subsampling.

Deep Learning

Multi-layer feedforward neural networks with adaptive learning rates.

GLM & GAM

Regularized linear models (elastic net) and generalized additive models.

Stacked Ensembles

Super-learner ensembles that combine cross-validated base model predictions.

Clustering & Dimensionality Reduction

K-Means clustering and PCA for unsupervised learning.

Get Started

Core Concepts

Algorithms

Model Workflows

Deployment

Algorithm Overview

Supported Algorithms

Choosing the Right Algorithm

By Problem Type

By Data Characteristics

By Interpretability Need

Algorithm Pages

AutoML

GBM & XGBoost

Distributed Random Forest

Deep Learning

GLM & GAM

Stacked Ensembles

Clustering & Dimensionality Reduction

Build docs developers (and LLMs) love

Get Started

Core Concepts

Algorithms

Model Workflows

Deployment

​Supported Algorithms

​Choosing the Right Algorithm

​By Problem Type

​By Data Characteristics

​By Interpretability Need

​Algorithm Pages

AutoML

GBM & XGBoost

Distributed Random Forest

Deep Learning

GLM & GAM

Stacked Ensembles

Clustering & Dimensionality Reduction

Build docs developers (and LLMs) love

Supported Algorithms

Choosing the Right Algorithm

By Problem Type

By Data Characteristics

By Interpretability Need

Algorithm Pages