Accounts Database - Harmonic Salsa

The Accounts Database is the persistent storage layer for all account data in Harmonic Salsa. It provides high-performance storage, indexing, and snapshot capabilities.

Overview

From ~/workspace/source/accounts-db/src/accounts_db.rs:1:

/// Persistent accounts are stored at this path location:
///  `<path>/<pid>/data/`
///
/// The persistent store would allow for this mode of operation:
///  - Concurrent single thread append with many concurrent readers.
///
/// The underlying memory is memory mapped to a file.

Architecture Components

The accounts database consists of several key components:

Core Structures

pub mod account_info;
pub mod account_storage;
pub mod accounts_db;
pub mod accounts_file;
pub mod accounts_hash;
pub mod accounts_index;
pub mod accounts_cache;
pub mod append_vec;

Storage Architecture

Append-Only Storage

Accounts stored in append-only files (AppendVec):

Memory-mapped files for fast access
Immutable once written
Multiple files per slot
Accounts never modified in place

Storage Layout

<data_dir>/
├── accounts/
│   ├── 0.0      # Storage file for slot 0
│   ├── 0.1      # Additional storage for slot 0
│   ├── 1.0      # Storage file for slot 1
│   └── ...
├── snapshots/
│   ├── 12345/   # Full snapshot at slot 12345
│   └── 12346/   # Incremental snapshot
└── accounts_index/
    └── ...      # Index data

AppendVec Format

Each AppendVec contains:

// Header with metadata
// Followed by account entries:
// - Lamports (u64)
// - Data length (u64)  
// - Owner (Pubkey)
// - Executable flag (bool)
// - Rent epoch (u64)
// - Account data (bytes)
// - Padding for alignment

Storage Recycling

Old storage files recycled when:

Dead Accounts - No live accounts remain
Low Occupancy - Most accounts updated elsewhere
Ancient Storage - Very old storage files
Compaction - Combined with other storage

Account Indexing

The accounts index (~/workspace/source/accounts-db/src/accounts_index.rs:1) maps pubkeys to storage locations.

Index Structure

pub struct AccountsIndex<T: IndexValue> {
    // In-memory index of account locations
    // Maps: Pubkey -> SlotList<AccountInfo>
    // SlotList: [(Slot, AccountInfo), ...]
}

Account Info

pub struct AccountInfo {
    // Storage ID
    // Offset within storage
    // Lamports (cached)
}

Key properties:

Multiple slots per pubkey (fork history)
Sorted by slot number
Most recent slot typically used
Ref count for garbage collection

Index Sharding

Index sharded for parallelism:

pub const BINS_DEFAULT: usize = 8192;
// Shard index across 8192 bins
// Reduces lock contention
// Enables parallel operations

Disk Index

Index can spill to disk:

In-Memory - Hot accounts in RAM
On-Disk - Cold accounts on disk
LRU Policy - Keep recently used in memory
Async Loading - Load from disk as needed

Account Cache

Recently modified accounts cached in memory:

pub struct AccountsCache {
    // Map of slot -> accounts modified in that slot
    // Accounts not yet flushed to disk
    // Write-through cache behavior
}

Cache Operations

Store - Write to cache, async flush to disk
Load - Check cache first, then disk
Flush - Write cached accounts to AppendVec
Evict - Remove from cache after flush

Write Cache

Default cache size (~/workspace/source/accounts-db/src/accounts_db.rs:98):

const WRITE_CACHE_LIMIT_BYTES_DEFAULT: u64 = 15_000_000_000;
// 15 GB write cache
// Configurable via --accounts-db-cache-limit-mb

Account Hashing

Accounts hashed for verification:

Hash Computation

// For each account:
hash = hash(
    lamports,
    owner,
    executable,
    rent_epoch,
    data,
    pubkey
)

Lattice Hash (LtHash)

Used for incremental hashing:

Homomorphic hash function
Add/remove accounts without rehashing
Efficient for large account sets
Used in snapshots

Accounts Hash

Full accounts hash computed for:

Snapshot creation
Cluster consensus on state
State verification
Fork comparison

Snapshot System

Full Snapshots

Complete state at a slot:

Freeze Bank - No more modifications
Hash Accounts - Compute full hash
Write Storage - Copy AppendVec files
Write Metadata - Bank state and hash
Compress Archive - Create tar.zst file

Incremental Snapshots

Changes since last full snapshot:

Only modified accounts
Much smaller than full snapshot
Faster to create and load
Chain with full snapshot

Snapshot Loading

Download - Get snapshot from peer or disk
Decompress - Extract archive
Load Accounts - Mmap storage files
Build Index - Scan storage and index
Verify Hash - Ensure correctness
Resume - Continue from snapshot slot

Background Services

Accounts Background Service

Runs cleanup operations:

Snapshot Generation - Create periodic snapshots
Shrink Storage - Compact underutilized storage
Clean Accounts - Remove old versions
Purge Slots - Delete rooted-out forks

Clean Process

Remove old account versions:

Identify Dead - Find superseded accounts
Remove from Index - Update index
Mark Storage - Track dead bytes
Schedule Shrink - If occupancy low

Shrink Process

Compact sparse storage:

Select Candidates - Find low-occupancy storage
Read Alive Accounts - Load still-referenced accounts
Write New Storage - Append to new file
Update Index - Point to new locations
Delete Old Storage - Remove old file

Ancient Append Vecs

Very old storage files:

Combined into larger files
Reduces file count
Improves space efficiency
Special handling for durability

Read-Only Accounts Cache

Cache for read-only accounts:

pub struct ReadOnlyAccountsCache {
    // LRU cache of read-only accounts
    // Reduces disk I/O for popular programs
    // Configurable size limit
}

Benefits:

Faster program loading
Reduced disk contention
Better performance for hot programs

Account Locks

Transactions lock accounts during execution:

pub mod account_locks;
// Validates no read/write conflicts
// Allows parallel execution of non-conflicting txs
// Prevents race conditions

Lock Rules

Write Lock - Exclusive access
Read Lock - Shared access
Validation - Before transaction execution
Release - After batch complete

Scanning Accounts

Scan Operations

Iterate over all accounts:

pub fn scan_accounts<F>(
    &self,
    ancestors: &Ancestors,
    scan_func: F,
) where F: Fn(&Pubkey, (&T, Slot))

Use cases:

Snapshot creation
Hash computation
Rent collection
Account analysis

Parallel Scanning

Scan in parallel using shards:

const SCAN_SLOT_PAR_ITER_THRESHOLD: usize = 4000;
// Parallel scan if >= 4000 accounts

Performance Optimizations

Memory Mapping

Zero-copy account reads
Kernel manages caching
Efficient for large datasets
OS handles page faults

Parallel Operations

Sharded index structure
Parallel account scanning
Concurrent storage access
Lock-free where possible

Cache Hierarchy

Write Cache - Recent modifications
Read Cache - Popular read-only accounts
OS Page Cache - Memory-mapped files
Disk - Persistent storage

Batch Processing

Flush multiple accounts together
Amortize I/O overhead
Better disk utilization
Reduced write amplification

Tiered Storage

Optimized storage for different access patterns:

pub mod tiered_storage;
// Hot tier: Frequently accessed
// Cold tier: Rarely accessed
// Archive tier: Ancient data

Configuration

AccountsDbConfig

pub struct AccountsDbConfig {
    // Number of index bins
    pub bins: Option<usize>,
    
    // Number of flush threads
    pub num_flush_threads: Option<NonZeroUsize>,
    
    // Storage paths
    pub drives: Option<Vec<PathBuf>>,
    
    // Index size limit
    pub index_limit_mb: IndexLimitMb,
}

Testing Config

pub const ACCOUNTS_DB_CONFIG_FOR_TESTING: AccountsDbConfig = AccountsDbConfig {
    bins: Some(BINS_FOR_TESTING),
    num_flush_threads: Some(FLUSH_THREADS_TESTING),
    drives: None,
    index_limit_mb: IndexLimitMb::InMemOnly,
    // ...
};

Error Handling

Storage Errors

Disk Full - No space for new storage
Corruption - Storage file damaged
I/O Error - Disk read/write failure

Recovery Strategies

Retry operations
Load from snapshot
Reconstruct from ledger
Request from peers

Monitoring

Metrics Tracked

Storage usage per slot
Cache hit/miss rates
Clean/shrink performance
Scan latency
Flush time

Stats Reporting

pub struct AccountsStats {
    // Detailed statistics
    // Performance metrics
    // Resource utilization
}

Key Files

Accounts DB: ~/workspace/source/accounts-db/src/accounts_db.rs
Accounts Index: ~/workspace/source/accounts-db/src/accounts_index.rs
Accounts Cache: ~/workspace/source/accounts-db/src/accounts_cache.rs
Account Storage: ~/workspace/source/accounts-db/src/account_storage.rs
Append Vec: ~/workspace/source/accounts-db/src/append_vec.rs

Get Started

Validator

CLI Tools

Architecture

​Overview

​Architecture Components

​Core Structures

​Storage Architecture

​Append-Only Storage

​Storage Layout

​AppendVec Format

​Storage Recycling

​Account Indexing

​Index Structure

​Account Info

​Index Sharding

​Disk Index

​Account Cache

​Cache Operations

​Write Cache

​Account Hashing

​Hash Computation

​Lattice Hash (LtHash)

​Accounts Hash

​Snapshot System

​Full Snapshots

​Incremental Snapshots

​Snapshot Loading

​Background Services

​Accounts Background Service

​Clean Process

​Shrink Process

​Ancient Append Vecs

​Read-Only Accounts Cache

​Account Locks

​Lock Rules

​Scanning Accounts

​Scan Operations

​Parallel Scanning

​Performance Optimizations

​Memory Mapping

​Parallel Operations

​Cache Hierarchy

​Batch Processing

​Tiered Storage

​Configuration

​AccountsDbConfig

​Testing Config

​Error Handling

​Storage Errors

​Recovery Strategies

​Monitoring

​Metrics Tracked

​Stats Reporting

​Key Files

Build docs developers (and LLMs) love