Uploading CSV Files
The most straightforward way to add data to Meridian is by uploading CSV files.Navigate to Your Workspace
Open Meridian and ensure you’re logged in. Your uploaded files will be associated with your user account.
Select Your CSV File
Click the upload button and select a CSV file from your computer. Meridian supports standard CSV format with headers.Supported formats:
.csvfiles with comma-separated values- UTF-8 encoded files
- Files with header rows (column names)
Upload and Process
Once uploaded, Meridian automatically:
- Stores your file securely in Cloudflare R2
- Creates a DuckDB table for fast querying
- Generates a unique table name
- Indexes the data for optimal performance
Extracting Data from URLs
Meridian can extract structured data from web pages using Firecrawl, making it easy to analyze online data without manual copying.Enter the URL
Instead of uploading a file, paste the URL of the webpage containing the data you want to analyze.
Describe What You Want
Provide a natural language prompt describing the data you want to extract:Example prompts:
- “Extract all job listings with title, company, and location”
- “Get product names and prices from this page”
- “List all articles with title, author, and publication date”
AI-Powered Extraction
Meridian uses Firecrawl to:
- Scrape the webpage content
- Apply AI to understand the structure
- Extract data matching your description
- Convert it to structured JSON format
File Management
Viewing Uploaded Files
All your uploaded files and extracted datasets are tracked in Meridian. Each file record includes:- File name - Original name or generated name for URL extracts
- File type - CSV, JSON, or other supported formats
- File size - Size in bytes
- Upload date - When the file was added
- DuckDB table name - Internal table identifier for queries
- Processing status - Whether DuckDB indexing is complete
Accessing File Data
You can query your uploaded data immediately after processing:Deleting Files
When you delete a file:- The file is removed from storage (R2)
- The database record is deleted
- The DuckDB table remains until you explicitly drop it
Data Storage Architecture
Meridian uses a hybrid storage approach for optimal performance:Cloudflare R2
Raw file storage for CSV and JSON files. Provides durable, cost-effective object storage.
DuckDB
Columnar analytical database for fast querying. Handles millions of rows with sub-second response times.
Convex
Real-time metadata storage. Tracks file information and enables live collaboration.
TanStack Start
Server-side processing. Runs DuckDB natively (not WASM) for maximum performance.
Best Practices
Prepare Your CSV Files
Prepare Your CSV Files
- Include a header row with descriptive column names
- Use consistent data types within each column
- Avoid special characters in column names (use underscores instead of spaces)
- Keep file sizes reasonable (under 100MB for optimal performance)
- Use UTF-8 encoding to prevent character issues
URL Extraction Tips
URL Extraction Tips
- Be specific in your extraction prompts
- Test with a single page before extracting from multiple URLs
- Check the data quality after extraction
- URLs with dynamic content (JavaScript-rendered) work well with Firecrawl
- Include expected column names in your prompt for better results
Naming Conventions
Naming Conventions
- Use descriptive file names that indicate the data contents
- For URL extracts, provide a custom table name if needed
- Table names are sanitized to remove special characters
- Avoid duplicate table names to prevent conflicts
Troubleshooting
What’s Next?
Now that your data is uploaded, you can:Query with SQL
Write powerful SQL queries to transform and analyze your data
Use AI Agents
Let AI agents help you explore and understand your data