ONNX Export

Overview

The export_onnx.py script converts trained scikit-learn models to ONNX format using skl2onnx, enabling deployment in non-Python environments and cross-platform inference runtimes.

Prerequisites

Train the model

Ensure artifacts/best_model.joblib exists:

python -m src.train

Verify config.yaml

Check that artifact paths are configured:

artifacts:
  model_dir: artifacts
  model_file: best_model.joblib
  threshold_file: threshold.txt

Usage

python deployment/export_onnx.py

No command-line arguments required. Configuration is read from config.yaml.

How It Works

1. Load Trained Model

The script loads the scikit-learn pipeline from the artifact path specified in config.yaml:

deployment/export_onnx.py

config = load_config("config.yaml")
model_path = Path(config["artifacts"]["model_dir"]) / config["artifacts"]["model_file"]
model = joblib.load(model_path)

2. Infer Input Schema

Automatic type inference from test data determines ONNX input types:

deployment/export_onnx.py

def _build_initial_types(df):
    initial_types = []
    for col in df.columns:
        kind = df[col].dtype.kind
        if kind in {"i", "u", "b", "f"}:  # numeric
            initial_types.append((col, FloatTensorType([None, 1])))
        else:  # categorical
            initial_types.append((col, StringTensorType([None, 1])))
    return initial_types

Numeric features (int, uint, bool, float) → FloatTensorType
Categorical features (object, string) → StringTensorType
Batch dimension is dynamic (None) to support variable-length inputs

3. Convert with skl2onnx

The pipeline is converted to ONNX with target opset version 15:

deployment/export_onnx.py

from skl2onnx import to_onnx

initial_types = _build_initial_types(X_test)
onx = to_onnx(model, initial_types=initial_types, target_opset=15)

Target opset 15 is compatible with ONNX Runtime 1.10+ and provides stable operator support for common scikit-learn estimators.

4. Serialize and Save Artifacts

Three files are written to artifacts/:

deployment/export_onnx.py

(out_dir / "model.onnx").write_bytes(onx.SerializeToString())
(out_dir / "onnx_features.json").write_text(
    json.dumps(X_test.columns.tolist(), indent=2)
)
(out_dir / "onnx_metadata.json").write_text(
    json.dumps({"threshold": threshold, "model_source": str(model_path)}, indent=2)
)

Output Artifacts

artifacts/model.onnx

Binary ONNX graph containing the full inference pipeline.

Includes preprocessing transformers, imputers, encoders, and estimator
Can be loaded by any ONNX Runtime (Python, C++, C#, Java, JavaScript)
Typical size: 10-100 KB for tree ensembles, larger for neural networks

artifacts/onnx_features.json

Feature schema listing expected input column names in order.

[
  "feature_1",
  "feature_2",
  "feature_3"
]

Use this to validate input data before inference.

artifacts/onnx_metadata.json

Model metadata capturing threshold and provenance.

{
  "threshold": 0.3826,
  "model_source": "artifacts/best_model.joblib"
}

threshold: Optimal decision boundary from training
model_source: Path to original scikit-learn artifact

Expected Output

$ python deployment/export_onnx.py
Wrote artifacts/model.onnx, onnx_features.json, onnx_metadata.json

Conversion Process Details

Supported Pipeline Components

Preprocessing
Estimators
Pipelines

SimpleImputer
StandardScaler, MinMaxScaler, RobustScaler
OneHotEncoder, OrdinalEncoder, LabelEncoder
PolynomialFeatures
FunctionTransformer (limited support)

LogisticRegression
RandomForestClassifier
GradientBoostingClassifier
SVC (kernel=‘linear’, ‘rbf’, ‘poly’)
MLPClassifier

Pipeline
ColumnTransformer
FeatureUnion

Custom transformers not recognized by skl2onnx will cause conversion failures. Implement a custom converter or refactor to use supported components.

Operator Compatibility

The conversion targets ONNX opset 15, which guarantees:

All standard scikit-learn operators (tree models, linear models, SVMs)
String processing for categorical features
ZipMap for dictionary outputs (probability dicts)

Troubleshooting

Error: Train first: python -m src.train

Cause: artifacts/best_model.joblib not found.Solution:

python -m src.train

RuntimeError: No suitable converter found

Cause: Pipeline contains unsupported custom transformer.Solution:

Check skl2onnx supported operators
Replace custom transformer with supported equivalent
Or implement custom converter

ValueError: Initial types mismatch

Cause: Feature dtype inference failed or preprocessing changes.Solution:

Verify X_test columns match model’s expected features
Check for NaN or infinite values in test data
Ensure categorical features are properly encoded

ONNX model produces different outputs

Cause: Numerical precision differences or preprocessing mismatch.Solution: Run parity validation to diagnose:

python deployment/parity_check.py --abs-tol 0.04 --mean-tol 0.01

Verification

After export, verify the ONNX model loads correctly:

import onnxruntime as ort

sess = ort.InferenceSession("artifacts/model.onnx", providers=["CPUExecutionProvider"])
print("Input names:", [inp.name for inp in sess.get_inputs()])
print("Output names:", [out.name for out in sess.get_outputs()])

Getting Started

Core Concepts

Training

Deployment

Optimization

Runtime

Overview

Prerequisites

Usage

How It Works

1. Load Trained Model

2. Infer Input Schema

3. Convert with skl2onnx

4. Serialize and Save Artifacts

Output Artifacts

Expected Output

Conversion Process Details

Supported Pipeline Components

Operator Compatibility

Troubleshooting

Verification

Next Steps

Quantization

Parity Validation

Build docs developers (and LLMs) love

Getting Started

Core Concepts

Training

Deployment

Optimization

Runtime

​Overview

​Prerequisites

​Usage

​How It Works

​1. Load Trained Model

​2. Infer Input Schema

​3. Convert with skl2onnx

​4. Serialize and Save Artifacts

​Output Artifacts

​Expected Output

​Conversion Process Details

​Supported Pipeline Components

​Operator Compatibility

​Troubleshooting

​Verification

​Next Steps

Quantization

Parity Validation

Build docs developers (and LLMs) love

Overview

Prerequisites

Usage

How It Works

1. Load Trained Model

2. Infer Input Schema

3. Convert with skl2onnx

4. Serialize and Save Artifacts

Output Artifacts

Expected Output

Conversion Process Details

Supported Pipeline Components

Operator Compatibility

Troubleshooting

Verification

Next Steps