Model registration

vLLM relies on a model registry to determine how to run each model. This page provides detailed instructions on how to register your model. A list of pre-registered architectures can be found in the supported models documentation.

Registration options

There are two ways to register a model with vLLM:

Built-in models: Add your model directly to the vLLM library
Out-of-tree models: Load an external model using a plugin

Built-in models

To add a model directly to the vLLM library:

Fork and build vLLM

Start by forking the GitHub repository and then build it from source.This gives you the ability to modify the codebase and test your model.

Implement your model

After implementing your model (see adding models guide), place it in the vllm/model_executor/models directory.

Add your model class to _VLLM_MODELS in vllm/model_executor/models/registry.py so that it is automatically registered upon importing vLLM:

vllm/model_executor/models/registry.py

_VLLM_MODELS = {
    "AquilaForCausalLM": "aquila",
    "BaiChuanForCausalLM": "baichuan",
    # ... other models ...
    "YourModelForCausalLM": "your_model",  # Add your model here
}

Update documentation

Update the list of supported models to promote your model.

The list of models in each section should be maintained in alphabetical order.

Out-of-tree models

You can load an external model using a plugin without modifying the vLLM codebase. This is useful for:

Proprietary models that cannot be open-sourced
Experimental models still under development
Custom modifications to existing models

Creating a plugin

Create plugin structure

Create a Python package with the following structure:

my_vllm_plugin/
├── my_vllm_plugin/
│   ├── __init__.py
│   └── my_model.py
└── setup.py

Implement the model

In my_vllm_plugin/my_model.py, implement your model:

my_vllm_plugin/my_model.py

import torch.nn as nn
from vllm.config import VllmConfig

class YourModelForCausalLM(nn.Module):
    def __init__(self, *, vllm_config: VllmConfig, prefix: str = ""):
        super().__init__()
        # Your model implementation
        ...

Create registration function

In my_vllm_plugin/__init__.py, create the registration entry point:

# The entrypoint of your plugin
def register():
    from vllm import ModelRegistry
    from my_vllm_plugin.my_model import YourModelForCausalLM

    ModelRegistry.register_model(
        "YourModelForCausalLM",
        YourModelForCausalLM
    )

Configure setup.py

In setup.py, register the plugin entry point:

setup.py

from setuptools import setup, find_packages

setup(
    name="my_vllm_plugin",
    version="0.1.0",
    packages=find_packages(),
    entry_points={
        "vllm.general_plugins": [
            "register_my_model = my_vllm_plugin:register"
        ]
    },
)

Install the plugin

Install your plugin package:

pip install -e ./my_vllm_plugin

Use your model

Your model will now be automatically discovered and registered by vLLM:

from vllm import LLM

# vLLM automatically loads your plugin
llm = LLM(model="path/to/your/model")

Multimodal models

If your model is a multimodal model, ensure the model class implements the SupportsMultiModal interface:

from vllm.model_executor.models.interfaces import SupportsMultiModal

class YourModelForImage2Seq(nn.Module, SupportsMultiModal):
    ...

Read more about multimodal support in the multimodal guide.

Plugin system details

vLLM’s plugin system uses the standard Python entry_points mechanism. Every plugin has three parts:

Plugin group: The name of the entry point group (e.g., vllm.general_plugins)
Plugin name: The name of the plugin (e.g., register_my_model)
Plugin value: The fully qualified name of the function or module to register

For more information on plugins, see the plugin system design.

Example: Complete plugin

Here’s a complete example of a plugin that registers a custom model:

from setuptools import setup, find_packages

setup(
    name="vllm_custom_llama",
    version="0.1.0",
    packages=find_packages(),
    install_requires=[
        "vllm>=0.4.0",
    ],
    entry_points={
        "vllm.general_plugins": [
            "register_custom_llama = vllm_custom_llama:register"
        ]
    },
)

Contributing

Model Implementation

Design

Model registration

Registration options

Built-in models

Out-of-tree models

Creating a plugin

Multimodal models

Plugin system details

Example: Complete plugin

Next steps

Testing guide

Plugin system

Build docs developers (and LLMs) love

Contributing

Model Implementation

Design

​Registration options

​Built-in models

​Out-of-tree models

​Creating a plugin

​Multimodal models

​Plugin system details

​Example: Complete plugin

​Next steps

Testing guide

Plugin system

Build docs developers (and LLMs) love

Registration options

Built-in models

Out-of-tree models

Creating a plugin

Multimodal models

Plugin system details

Example: Complete plugin

Next steps