Data Sources

Evidence connects to databases and extracts data into a common storage format (Parquet) to enable querying across multiple data sources using SQL.

How Data Sources Work

Evidence uses a Universal SQL architecture that:

Extracts data from your source databases into Parquet files
Enables querying across multiple data sources using DuckDB SQL dialect
Stores extracted data locally for fast query performance

This architecture allows you to query data from PostgreSQL, Snowflake, BigQuery, and other sources in a single SQL query.

Connecting a Data Source

To connect your development environment to a database:

Using the Settings UI

Start your Evidence app:
```
npm run dev
```
Navigate to localhost:3000/settings
Select your data source type, name it, and enter required credentials
Test the connection to verify it works

Evidence saves your credentials locally in your development environment. Production credentials are managed via environment variables.

Supported Data Sources

Evidence supports a wide range of data sources: SQL Databases:

BigQuery
Snowflake
Redshift
PostgreSQL / Timescale
Microsoft SQL Server
MySQL
SQLite
DuckDB
MotherDuck
Databricks
Trino
Cube

File-Based Sources:

CSV files
Google Sheets

Custom Sources:

JavaScript data sources
API connectors via plugins

Configuring Source Queries

For SQL data sources, you define which data to extract by adding .sql files to the /sources/[source_name]/ directory.

.-- sources/
   `-- my_database/
      |-- connection.yaml
      |-- customers.sql
      `-- orders.sql

Source queries use your data source’s native SQL dialect (e.g., BigQuery SQL, Snowflake SQL), not DuckDB SQL.

Example Source Query

Create a file sources/my_database/orders.sql:

SELECT 
    order_id,
    customer_id,
    order_date,
    total_amount,
    status
FROM orders
WHERE order_date >= CURRENT_DATE - INTERVAL '90 days'

This creates a table accessible in your Evidence pages as my_database.orders.

Running Sources

Extract data from your configured sources:

npm run sources

With the dev server running, sources automatically re-run when you change source queries.

Running Specific Sources

For large data sources, you can run only what you need:

# Run only changed sources
npm run sources -- --changed

# Run a specific source
npm run sources -- --sources my_database

# Run specific queries from a source
npm run sources -- --sources my_database --queries orders,customers

Using Extracted Data

Once extracted, query your data in Evidence pages using the DuckDB SQL dialect:

```sql recent_orders
SELECT 
    order_date,
    COUNT(*) as order_count,
    SUM(total_amount) as total_sales
FROM my_database.orders
GROUP BY order_date
ORDER BY order_date DESC
```

<LineChart 
    data={recent_orders}
    x="order_date"
    y="total_sales"
/>

Build-Time Variables

You can parameterize source queries using environment variables with the EVIDENCE_VAR__ prefix. .env

EVIDENCE_VAR__min_date=2024-01-01
EVIDENCE_VAR__region=us-west

In your source query:

SELECT *
FROM sales
WHERE date >= '${min_date}'
  AND region = '${region}'

Build-time variables are only available in source queries, not in page queries or markdown files.

Working with Large Data

If you encounter memory errors when running sources:

FATAL ERROR: Reached heap limit Allocation failed - JavaScript heap out of memory

Increase the memory allocation: macOS / Linux:

NODE_OPTIONS="--max-old-space-size=4096" npm run sources

Windows:

set NODE_OPTIONS=--max-old-space-size=4096 && npm run sources

Production Deployment

In production, credentials are managed via environment variables. Each data source has specific environment variables for credentials.

See the deployment configuration documentation for details on setting up production credentials.

Need Help?

If you need a data source that isn’t currently supported:

The source code for Evidence’s connectors is available on GitHub.

Get Started

Core Concepts

Data Sources

Components

Deployment

Guides

Data Sources

Data Sources

How Data Sources Work

Connecting a Data Source

Using the Settings UI

Supported Data Sources

Configuring Source Queries

Example Source Query

Running Sources

Running Specific Sources

Using Extracted Data

Build-Time Variables

Working with Large Data

Production Deployment

Need Help?

Build docs developers (and LLMs) love

Get Started

Core Concepts

Data Sources

Components

Deployment

Guides

​Data Sources

​How Data Sources Work

​Connecting a Data Source

​Using the Settings UI

​Supported Data Sources

​Configuring Source Queries

​Example Source Query

​Running Sources

​Running Specific Sources

​Using Extracted Data

​Build-Time Variables

​Working with Large Data

​Production Deployment

​Need Help?

Build docs developers (and LLMs) love

Data Sources

How Data Sources Work

Connecting a Data Source

Using the Settings UI

Supported Data Sources

Configuring Source Queries

Example Source Query

Running Sources

Running Specific Sources

Using Extracted Data

Build-Time Variables

Working with Large Data

Production Deployment

Need Help?