Introduction to Remote Runners

A BuildBuddy remote runner is an execution environment that runs on a BuildBuddy executor. Our remote runners are optimized to run Bazel commands, and allow users to maintain a warm bazel instance in a secure execution environment managed by BuildBuddy. For example, this might look like a Firecracker microVM or an OCI container where you can run Bazel commands.

Key advantages

BuildBuddy remote runners have the following unique advantages:

Fast Network Connection

Colocation with BuildBuddy servers ensures sub-millisecond round trip times between Bazel and BuildBuddy’s cache & RBE servers.

Persistent Execution

Running workloads in persistent execution environments using microVM snapshotting (on Linux) and persistent runners (on macOS) allows reusing Bazel’s in-memory analysis cache and local disk cache.

Products built on remote runners

There are two ways to use remote runners:

BuildBuddy Workflows

Our continuous integration (CI) solution that runs Bazel builds and tests in response to git events (pull requests or pushes).

Remote Bazel

A CLI tool that works exactly like the Bazel command, but runs Bazel on a remote workspace and streams the output back to the local machine.

See our blog post for more details on the motivation behind remote runners as well as some real-world results.

Differences between Workflows and Remote Bazel

In many ways, Remote Bazel and Workflows are the same product and share much of the same backend code. Both are mechanisms to run code on remote runners. The primary difference is the entrypoint.

Workflows
Remote Bazel

Workflows are configured with a config YAML that is checked in to GitHub. Remote runs can be automatically triggered by GitHub events, like push and pull events. Workflows are commonly used as a Continuous Integration (CI) solution.Workflows are a good fit if you:

Have a static list of commands to run

Want your commands checked in to your codebase for review

Are exclusively using BuildBuddy to run CI and do not have another CI provider that can initiate commands

Benefits of remote runners

Colocation with BuildBuddy servers

Network latency is often the biggest bottleneck in many Bazel Remote Build Execution and Remote Caching setups. This is because Bazel’s remote APIs require several chained RPCs due to dependencies between actions. To address this bottleneck, BuildBuddy remote runners are executed in the same datacenters where BuildBuddy RBE and Cache nodes are deployed. This results in sub-millisecond round trip times to BuildBuddy’s servers, minimizing the overhead incurred by Bazel’s remote APIs.

Hosted, warm, Bazel instances

Running Bazel on most CI solutions is typically expensive and slow. There are several sources of overhead:

Bazel download overhead

When using Bazelisk, Bazel itself is re-downloaded and extracted on each CI run.

Cold JVM start

The Bazel server starts from a cold JVM, meaning that it will be running unoptimized code until the JIT compiler kicks in.

Empty analysis cache

Bazel’s analysis cache starts empty, which often means the entire workspace has to be re-scanned on each CI run.

Repository re-fetching

Any remote repositories referenced by the Bazel workspace all have to be re-fetched on each run.

Empty disk cache

Bazel’s on-disk cache starts completely empty, causing action re-execution or excess remote cache usage.

A common solution is to use something like actions/cache to store Bazel’s cache for reuse between runs, but this solution is extremely data-intensive, as Bazel’s cache can be several GB in size and consist of many individual files which are expensive to unpack from an archive. It also does not solve the problems associated with the Bazel server having starting from scratch. By contrast, BuildBuddy uses a Bazel workspace reuse approach, similar to how Google’s Build Dequeuing Service performs workspace selection:

A well-chosen workspace can increase the build speed by an order of magnitude by reusing the various cached results from the previous execution. […] We have observed that builds that execute the same targets as a previous build are effectively no-ops using this technique

Runner recycling

To match workloads to warm runners, BuildBuddy uses VM snapshotting powered by Firecracker on Linux, and a simpler runner-recycling based approach on macOS.

Firecracker VMs (Linux)
Runner Recycling (macOS)

On Linux, remote runs are executed inside Firecracker VMs, which have a low startup time (hundreds of milliseconds). VM snapshots include the full disk and memory contents of the machine, meaning that the Bazel server is effectively kept warm between runs.Remote runners use a sophisticated snapshotting mechanism that minimizes the work that Bazel has to do on each CI run.First, VM snapshots can be stored both locally on the machine that ran the remote run, as well as remotely in BuildBuddy’s cache. This way, if the original machine that ran the remote run is fully occupied with other workloads, subsequent runs can be executed on another machine, but still be able to resume from a warm VM snapshot. BuildBuddy stores VM snapshots in granular chunks that are downloaded lazily, so that unneeded disk and memory chunks are not re-downloaded.Second, snapshots are stored using a branching model that closely mirrors the branching structure of the git repository itself, allowing remote runs to be matched optimally to VM snapshots.

For more technical details on our VM implementation, see our BazelCon talk Reusing Bazel’s Analysis Cache by Cloning Micro-VMs.

Optimal usage of remote runners

While our remote runners support running arbitrary bash code, they were specifically designed and optimized to run the Bazel client server and power Bazel commands with remote execution (RBE).

Because we snapshot the entire memory and disk of each runner, the product is slower when we must serialize and deserialize larger runners. Thus we have a 100GB limit on disks for remote runners. It’s more effective when a smaller remote runner is used to orchestrate farming out most computation to traditional Bazel remote executors.

Getting started

You can get started with BuildBuddy Workflows or Remote Bazel by checking out the corresponding docs.

If you’ve already linked your GitHub account to BuildBuddy, it’ll only take about 30 seconds to enable remote runners for your repo—just select a repo to link, and we’ll take care of the rest!

Get Started

Products

Features

Guides

Deployment

Contributing

Introduction to Remote Runners

Key advantages

Fast Network Connection

Persistent Execution

Products built on remote runners

BuildBuddy Workflows

Remote Bazel

Differences between Workflows and Remote Bazel

Benefits of remote runners

Colocation with BuildBuddy servers

Hosted, warm, Bazel instances

Runner recycling

Optimal usage of remote runners

Getting started

Build docs developers (and LLMs) love

Get Started

Products

Features

Guides

Deployment

Contributing

​Key advantages

Fast Network Connection

Persistent Execution

​Products built on remote runners

BuildBuddy Workflows

Remote Bazel

​Differences between Workflows and Remote Bazel

​Benefits of remote runners

​Colocation with BuildBuddy servers

​Hosted, warm, Bazel instances

​Runner recycling

​Optimal usage of remote runners

​Getting started

Build docs developers (and LLMs) love

Key advantages

Products built on remote runners

Differences between Workflows and Remote Bazel

Benefits of remote runners

Colocation with BuildBuddy servers

Hosted, warm, Bazel instances

Runner recycling

Optimal usage of remote runners

Getting started