Query Overview

This guide provides an overview of querying Snuba, including how to send queries, understand responses, and work with both SnQL and MQL query languages.

Query Languages

Snuba supports two query languages:

SnQL (Snuba Query Language): A SQL-like language for querying events, transactions, and other entities
MQL (Metrics Query Language): A specialized language for querying metrics and timeseries data

Both languages can be sent to Snuba via HTTP endpoints and return structured JSON responses.

Endpoints

Snuba provides dataset-specific endpoints for executing queries:

SnQL Endpoint

POST /:dataset/snql

For example:

/discover/snql - Query the discover dataset
/events/snql - Query the events dataset
/transactions/snql - Query the transactions dataset

MQL Endpoint

POST /:dataset/mql

For example:

/generic_metrics/mql - Query metrics data

Request Format

SnQL Request

SnQL queries are sent as JSON payloads with the following structure:

{
  "query": "MATCH (events) SELECT event_id, project_id WHERE timestamp >= toDateTime('2024-01-01')",
  "dataset": "discover",
  "debug": false,
  "consistent": false,
  "turbo": false,
  "tenant_ids": {
    "organization_id": 123,
    "referrer": "my_app"
  }
}

query

string

required

The SnQL query string to execute

dataset

string

The dataset name (optional if specified in the URL)

debug

boolean

default:"false"

When true, returns additional debugging information including the generated SQL query and detailed statistics

consistent

boolean

default:"false"

Forces single-threaded execution and routes to the same node for sequential consistency

turbo

boolean

default:"false"

Applies sampling to speed up query execution at the cost of accuracy

tenant_ids

object

required

Identification information for the query:

organization_id: Organization identifier
referrer: String identifying the source of the query

MQL Request

MQL queries use a different structure with a query string and context object:

{
  "mql": "sum(transaction.duration){environment:prod} by status_code",
  "mql_context": {
    "entity": "generic_metrics_distributions",
    "start": "2023-01-02T03:04:05+00:00",
    "end": "2023-01-16T03:04:05+00:00",
    "rollup": {
      "granularity": 3600,
      "interval": 3600,
      "with_totals": null
    },
    "scope": {
      "org_ids": [1],
      "project_ids": [11],
      "use_case_id": "transactions"
    },
    "limit": 100,
    "offset": 0
  }
}

Response Format

Successful Response

A successful query returns a JSON response with the following structure:

{
  "data": [
    {
      "event_id": "abc123",
      "project_id": 1,
      "timestamp": "2024-01-01T12:00:00"
    },
    {
      "event_id": "def456",
      "project_id": 1,
      "timestamp": "2024-01-01T12:01:00"
    }
  ],
  "meta": [
    {
      "name": "event_id",
      "type": "UUID"
    },
    {
      "name": "project_id",
      "type": "UInt64"
    },
    {
      "name": "timestamp",
      "type": "DateTime"
    }
  ],
  "timing": {
    "timestamp": 1621038379,
    "duration_ms": 95,
    "marks_ms": {
      "cache_get": 1,
      "cache_set": 4,
      "execute": 39,
      "prepare_query": 10,
      "rate_limit": 4,
      "validate_schema": 34
    }
  },
  "stats": {
    "clickhouse_table": "events_local",
    "final": false,
    "sample": null,
    "consistent": false,
    "result_rows": 2,
    "result_cols": 3,
    "query_id": "f09f3f9e1c632f395792c6a4bfe7c4fe"
  }
}

Response Fields

data

array

Array of result rows, where each row is an object with column names as keys

Debug Mode Response

When debug: true is set in the request, the response includes an additional sql field:

{
  "data": [...],
  "meta": [...],
  "timing": {...},
  "stats": {...},
  "sql": "SELECT event_id, project_id FROM events_local WHERE ..."
}

The sql field contains the actual ClickHouse SQL query that was generated and executed.

Error Response

Error responses follow this format:

{
  "error": {
    "type": "invalid_query",
    "message": "Missing >= condition with a datetime literal on column timestamp"
  }
}

HTTP Status Codes

200 OK
400 Bad Request
429 Too Many Requests
500 Internal Server Error

Query executed successfully

Using the Web UI

Snuba provides a minimal web UI for testing queries. When running Snuba locally, access it at:

http://127.0.0.1:1218/:dataset/snql

The UI provides a simple form where you can:

Enter your SnQL query
Set query options (debug, consistent, turbo)
View the response in formatted JSON

Using curl

You can send queries using curl or any HTTP client:

curl -X POST http://127.0.0.1:1218/discover/snql \
  -H "Content-Type: application/json" \
  -d '{
    "query": "MATCH (events) SELECT count() AS total WHERE project_id = 1 AND timestamp >= toDateTime('"'"'2024-01-01'"'"') AND timestamp < toDateTime('"'"'2024-01-02'"'"') LIMIT 1000",
    "tenant_ids": {
      "organization_id": 1,
      "referrer": "test"
    },
    "debug": true
  }'

Using the Python SDK

The recommended way to build queries programmatically is using the Snuba SDK:

from datetime import datetime
from snuba_sdk import (
    Column,
    Condition,
    Entity,
    Function,
    Limit,
    Op,
    Query,
)

query = Query(
    dataset="discover",
    match=Entity("events"),
    select=[
        Column("title"),
        Function("uniq", [Column("event_id")], "uniq_events"),
    ],
    groupby=[Column("title")],
    where=[
        Condition(Column("timestamp"), Op.GT, datetime(2021, 1, 1)),
        Condition(Column("project_id"), Op.IN, Function("tuple", [1, 2, 3])),
    ],
    limit=Limit(10),
)

Query Processing Pipeline

Snuba processes queries through multiple stages:

Parsing: SnQL/MQL string is parsed into an AST
Validation: Query is validated against entity requirements
Logical Processing: Entity-specific transformations are applied
Storage Selection: Optimal storage/table is selected
Translation: Logical query is translated to physical ClickHouse query
Physical Processing: Optimizations are applied
Execution: Query is executed on ClickHouse

This pipeline ensures queries are valid, optimized, and safe to execute.

Some entities require specific conditions (e.g., project_id and timestamp range) to prevent expensive queries. Check entity documentation for requirements.

Best Practices

Always include time bounds: Most entities require timestamp conditions to prevent full table scans
Use appropriate limits: Default limit is 1000 rows if not specified
Enable debug mode during development: Helps understand query performance and optimization
Use consistent mode sparingly: It reduces concurrency and may impact performance
Leverage the SDK: Building queries with the SDK prevents syntax errors and provides type safety

Next Steps

SnQL Syntax

Learn the complete SnQL syntax and available functions

MQL Syntax

Explore MQL for metrics and timeseries queries

Optimization

Optimize your queries for better performance

Get Started

Architecture

Query Languages

Datasets

Configuration

Migrations

Operations

Query Overview

Query Languages

Endpoints

SnQL Endpoint

MQL Endpoint

Request Format

SnQL Request

MQL Request

Response Format

Successful Response

Response Fields

Debug Mode Response

Error Response

HTTP Status Codes

Using the Web UI

Using curl

Using the Python SDK

Query Processing Pipeline

Best Practices

Next Steps

SnQL Syntax

MQL Syntax

Optimization

Build docs developers (and LLMs) love

Get Started

Architecture

Query Languages

Datasets

Configuration

Migrations

Operations

​Query Languages

​Endpoints

​SnQL Endpoint

​MQL Endpoint

​Request Format

​SnQL Request

​MQL Request

​Response Format

​Successful Response

​Response Fields

​Debug Mode Response

​Error Response

​HTTP Status Codes

​Using the Web UI

​Using curl

​Using the Python SDK

​Query Processing Pipeline

​Best Practices

​Next Steps

SnQL Syntax

MQL Syntax

Optimization

Build docs developers (and LLMs) love

Query Languages

Endpoints

SnQL Endpoint

MQL Endpoint

Request Format

SnQL Request

MQL Request

Response Format

Successful Response

Response Fields

Debug Mode Response

Error Response

HTTP Status Codes

Using the Web UI

Using curl

Using the Python SDK

Query Processing Pipeline

Best Practices

Next Steps