Benchmarks

LibXMTP includes comprehensive benchmarks to measure and track performance of critical operations.

Available Benchmarks

The xmtp_mls crate includes several benchmark suites:

group_limit: Benchmarks for maximum members adding/removing from groups
crypto: Benchmarks for cryptographic functions
identity: Benchmarks for identity operations
groups: Benchmarks for group operations
messages: Benchmarks for message handling
consent: Benchmarks for consent operations
sync_conversations: Benchmarks for conversation synchronization

Running Benchmarks

Run All Benchmarks

The simplest way to run all benchmarks:

./dev/bench

Run a Specific Benchmark

Run a single named benchmark:

./dev/bench add_1_member_to_group

Run a Benchmark Category

Run all benchmarks in a specific category:

cargo bench --features bench -p xmtp_mls --bench group_limit

All benchmark commands require the bench feature flag.

Benchmark Categories

Group Limit Benchmarks

Test group performance with varying member counts:

cargo bench --features bench -p xmtp_mls --bench group_limit

Crypto Benchmarks

Benchmark cryptographic operations:

cargo bench --features bench -p xmtp_mls --bench crypto

Identity Benchmarks

Benchmark identity-related operations:

cargo bench --features bench -p xmtp_mls --bench identity

Groups Benchmarks

Benchmark general group operations:

cargo bench --features bench -p xmtp_mls --bench groups

Messages Benchmarks

Benchmark message sending and processing:

cargo bench --features bench -p xmtp_mls --bench messages

Benchmark consent management:

cargo bench --features bench -p xmtp_mls --bench consent

Sync Conversations Benchmarks

Benchmark conversation synchronization:

cargo bench --features bench -p xmtp_mls --bench sync_conversations

Running Against Dev gRPC

To run benchmarks against the development gRPC server:

DEV_GRPC=1 cargo bench --features bench -p xmtp_mls --bench group_limit

Make sure the development gRPC server is running before using DEV_GRPC=1.

Profiling with Flamegraphs

Generate a flamegraph to visualize performance bottlenecks:

./dev/flamegraph add_1_member_to_group

This creates a visual representation of where time is spent during benchmark execution.

Benchmark Features

Benchmarks are gated behind the bench feature, which includes:

Test utilities
Progress indicators (via indicatif)
Tracing and logging
Criterion for benchmark framework
File descriptor limit management (via fdlimit)
Performance optimization tools

From Cargo.toml:

bench = [
  "test-utils",
  "indicatif",
  "tracing-subscriber",
  "criterion",
  "dep:fdlimit",
  "dep:alloy",
  "xmtp_common/bench",
]

Understanding Benchmark Results

Criterion produces detailed output including:

Time: Mean execution time with confidence intervals
Throughput: Operations per second (where applicable)
Change: Performance change compared to previous runs
Outliers: Statistical outliers in the measurements

Best Practices

Close unnecessary applications

Ensure consistent results by closing resource-intensive applications before running benchmarks.

Run multiple iterations

Criterion automatically runs multiple iterations, but you can increase iterations for more stable results.

Use flamegraphs for optimization

When optimizing, use flamegraphs to identify bottlenecks:

./dev/flamegraph your_benchmark_name

Compare against baseline

Criterion saves baseline results to compare against future runs, helping you track performance regressions.

Continuous Integration

Benchmarks can be run in CI to track performance over time. The results help identify performance regressions before they reach production.

Troubleshooting

File Descriptor Limits

If you encounter file descriptor limit errors, the fdlimit dependency (included with the bench feature) should automatically handle this. If issues persist, manually increase your system’s file descriptor limit.

Memory Issues

For large-scale benchmarks (like group_limit with many members), ensure you have sufficient memory available. Consider running benchmarks individually rather than all at once.

Inconsistent Results

If benchmark results are inconsistent:

Close background applications
Disable CPU frequency scaling (if possible)
Run benchmarks multiple times and look for patterns
Check for system resource contention

Getting Started

Core Concepts

Guides

Language Bindings

Security

Development

Available Benchmarks

Running Benchmarks

Run All Benchmarks

Run a Specific Benchmark

Run a Benchmark Category

Benchmark Categories

Group Limit Benchmarks

Crypto Benchmarks

Identity Benchmarks

Groups Benchmarks

Messages Benchmarks

Sync Conversations Benchmarks

Running Against Dev gRPC

Profiling with Flamegraphs

Benchmark Features

Understanding Benchmark Results

Best Practices

Continuous Integration

Troubleshooting

File Descriptor Limits

Memory Issues

Inconsistent Results

Build docs developers (and LLMs) love

Getting Started

Core Concepts

Guides

Language Bindings

Security

Development

​Available Benchmarks

​Running Benchmarks

​Run All Benchmarks

​Run a Specific Benchmark

​Run a Benchmark Category

​Benchmark Categories

​Group Limit Benchmarks

​Crypto Benchmarks

​Identity Benchmarks

​Groups Benchmarks

​Messages Benchmarks

​Consent Benchmarks

​Sync Conversations Benchmarks

​Running Against Dev gRPC

​Profiling with Flamegraphs

​Benchmark Features

​Understanding Benchmark Results

​Best Practices

​Continuous Integration

​Troubleshooting

​File Descriptor Limits

​Memory Issues

​Inconsistent Results

Build docs developers (and LLMs) love

Available Benchmarks

Running Benchmarks

Run All Benchmarks

Run a Specific Benchmark

Run a Benchmark Category

Benchmark Categories

Group Limit Benchmarks

Crypto Benchmarks

Identity Benchmarks

Groups Benchmarks

Messages Benchmarks

Consent Benchmarks

Sync Conversations Benchmarks

Running Against Dev gRPC

Profiling with Flamegraphs

Benchmark Features

Understanding Benchmark Results

Best Practices

Continuous Integration

Troubleshooting

File Descriptor Limits

Memory Issues

Inconsistent Results