Overview

S2 is a serverless datastore for real-time, streaming data. It provides durable, ordered streams backed by object storage with a simple yet powerful API.

What is S2?

S2 is designed to handle real-time data streams with guaranteed durability and ordering. Unlike traditional message queues or streaming platforms, S2 uses object storage (like AWS S3) as its storage backend, providing:

Serverless scalability: No servers to manage, scales automatically
Durable-first design: Data is always durable on object storage before being acknowledged
Simple API: HTTP-based REST API and streaming protocols
Cost-effective: Leverages inexpensive object storage for long-term retention

Core concepts

S2’s architecture is built around three fundamental concepts:

Basins

Namespaces that organize and configure streams

Streams

Ordered sequences of records with unique identifiers

Records

Individual data units with headers and body

How it works

The S2 architecture follows a clear data flow:

Write path

Client appends records to a stream within a basin
Sequencing: Each record receives a unique sequence number and timestamp
Durability: Records are written to object storage (using SlateDB in s2-lite)
Acknowledgment: Only after durability is confirmed, the client receives an ack
Broadcasting: Acknowledged records are broadcast to active followers

Read path

Client requests records from a stream starting at a position
Historical data is read from object storage if needed
Real-time data can be streamed via SSE or S2S protocols
Followers receive broadcasts of new records as they’re appended

Key guarantees

S2 provides strong guarantees that make it suitable for critical data pipelines:

Durability: All acknowledged writes are durable on object storage
Ordering: Records maintain strict ordering within a stream via sequence numbers
Exactly-once semantics: Using fencing tokens and conditional writes
Monotonic timestamps: Timestamps are guaranteed to be monotonically increasing

Storage architecture

S2-lite (the open-source implementation) uses SlateDB as its storage engine:

Object storage backend: All data lives in S3-compatible object storage
No local dependencies: Single binary with no external databases required
In-memory mode: Can run entirely in memory for testing
Configurable durability: Flush intervals and write options can be tuned

In-memory s2-lite is an excellent S2 emulator for integration tests without any external dependencies.

Use cases

S2 is well-suited for:

Event sourcing: Durable append-only logs for event-driven architectures
Change data capture: Streaming database changes to downstream systems
Analytics pipelines: Collecting and processing time-series data
IoT data ingestion: High-throughput ingestion from distributed sensors
Audit logs: Immutable, ordered logs for compliance and debugging

API overview

S2 provides multiple ways to interact with the service:

REST API

/basins - Manage basins (create, list, delete)
/streams - Manage streams (create, list, delete, check tail)
/streams/{stream}/records - Append and read records

Streaming protocols

SSE (Server-Sent Events): Browser-compatible streaming reads
S2S (S2 Streaming): High-performance bidirectional streaming with compression

Data formats

JSON: Human-readable with base64 encoding for binary data
Protobuf: Efficient binary encoding for production use

Next steps

Basins

Learn about organizing streams with basins

Streams

Understand how streams provide ordered sequences

Records

Explore the structure of individual records

Durability

Deep dive into durability guarantees

Get Started

Core Concepts

CLI

S2 Lite

SDKs

Overview

What is S2?

Core concepts

Basins

Streams

Records

How it works

Write path

Read path

Key guarantees

Storage architecture

Use cases

API overview

REST API

Streaming protocols

Data formats

Next steps

Basins

Streams

Records

Durability

Build docs developers (and LLMs) love

Get Started

Core Concepts

CLI

S2 Lite

SDKs

​What is S2?

​Core concepts

Basins

Streams

Records

​How it works

​Write path

​Read path

​Key guarantees

​Storage architecture

​Use cases

​API overview

​REST API

​Streaming protocols

​Data formats

​Next steps

Basins

Streams

Records

Durability

Build docs developers (and LLMs) love

What is S2?

Core concepts

How it works

Write path

Read path

Key guarantees

Storage architecture

Use cases

API overview

REST API

Streaming protocols

Data formats

Next steps