Data Sources Overview

Universal SQL Architecture

Evidence uses a Universal SQL architecture that allows you to connect to multiple data sources and query them using standard SQL. Each data source connector implements a common interface that:

Executes SQL queries against your data warehouse or database
Streams results efficiently in batches
Maps native data types to Evidence’s type system (string, number, date, boolean)
Handles authentication and connection management

How Data Sources Work

When you connect a data source to Evidence:

Configuration: Define connection parameters in your connection.yaml file
Authentication: Each connector handles authentication (service accounts, tokens, passwords, etc.)
Query Execution: SQL files in your project are executed against the connected data source
Type Mapping: Native database types are mapped to Evidence types for consistent rendering
Streaming: Results are streamed in batches for memory-efficient processing

Supported Data Sources

Evidence supports a wide range of data warehouses, databases, and file formats:

BigQuery

Google Cloud’s data warehouse

Snowflake

Cloud data platform

PostgreSQL

Popular open-source database

Redshift

Amazon’s data warehouse

DuckDB

Embedded analytical database

MotherDuck

Cloud-native DuckDB

MySQL

Popular relational database

SQL Server

Microsoft SQL Server

SQLite

Embedded database

Databricks

Lakehouse platform

Trino

Distributed SQL query engine

CSV Files

CSV and flat files

Connection Configuration

All data sources are configured using a connection.yaml file in your Evidence project:

name: my_database
type: postgres
options:
  host: localhost
  port: 5432
  database: mydb
  user: username
  password: ${POSTGRES_PASSWORD}

Use environment variables (with ${VAR_NAME} syntax) to keep sensitive credentials out of your code.

Type System

Evidence maps native database types to four core types:

string: Text, JSON, XML, time-only values
number: Integers, decimals, floats
date: Dates, timestamps, datetimes
boolean: True/false values

This consistent type system ensures that components and visualizations work the same way regardless of which data source you’re using.

Performance Considerations

Batch Size

Connectors stream results in batches (default: 100,000 rows) to balance memory usage and performance. Large result sets are processed incrementally.

Connection Pooling

Most connectors use connection pooling to efficiently manage database connections and improve query performance.

Row Count Estimation

When possible, connectors provide expected row counts to help Evidence optimize rendering and pagination.

Next Steps

Choose your data source from the list above to see specific installation and configuration instructions.

Get Started

Core Concepts

Data Sources

Components

Deployment

Guides

Data Sources Overview

Universal SQL Architecture

How Data Sources Work

Supported Data Sources

BigQuery

Snowflake

PostgreSQL

Redshift

DuckDB

MotherDuck

MySQL

SQL Server

SQLite

Databricks

Trino

CSV Files

Connection Configuration

Type System

Performance Considerations

Batch Size

Connection Pooling

Row Count Estimation

Next Steps

Build docs developers (and LLMs) love

Get Started

Core Concepts

Data Sources

Components

Deployment

Guides

​Universal SQL Architecture

​How Data Sources Work

​Supported Data Sources

BigQuery

Snowflake

PostgreSQL

Redshift

DuckDB

MotherDuck

MySQL

SQL Server

SQLite

Databricks

Trino

CSV Files

​Connection Configuration

​Type System

​Performance Considerations

​Batch Size

​Connection Pooling

​Row Count Estimation

​Next Steps

Build docs developers (and LLMs) love

Universal SQL Architecture

How Data Sources Work

Supported Data Sources

Connection Configuration

Type System

Performance Considerations

Batch Size

Connection Pooling

Row Count Estimation

Next Steps