Data Upload

Overview

Meridian provides flexible data ingestion options to get your data into the platform quickly. You can upload CSV files directly or extract structured data from web pages using AI-powered extraction.

Upload Methods

The data upload interface provides two methods for importing data:

File Upload

Drag and drop CSV, XLSX, or XLS files (up to 10MB) directly into Meridian. Files are automatically:

Uploaded to Cloudflare R2 storage
Processed into DuckDB tables
Made available for querying and analysis

URL Extraction

Extract structured data from any webpage using AI:

Enter the URL of the webpage
Provide a prompt describing what data to extract
Meridian uses Firecrawl to scrape and extract structured data
The extracted data is converted to a table automatically

import { Dropzone, MIME_TYPES } from '@mantine/dropzone'
import { useUploadFile } from '@convex-dev/r2/react'

const handleDrop = async (acceptedFiles: File[]) => {
  for (const file of acceptedFiles) {
    // Upload to R2
    const storageId = await uploadFile(file)
    
    // Save metadata
    const fileId = await saveFile({
      storageId,
      fileName: file.name,
      fileType: file.type,
      fileSize: file.size,
    })
    
    // Create DuckDB table for CSV files
    if (file.type === 'text/csv') {
      const result = await createTableFromCSV({
        data: { csvUrl, tableName }
      })
    }
  }
}

Implementation Details

The upload flow is implemented in FileUpload.tsx and follows this architecture:

File Processing Pipeline

Upload to Storage

Files are uploaded to Cloudflare R2 using the Convex R2 component:

const uploadFile = useUploadFile(api.r2)
const storageId = await uploadFile(file)

Save Metadata

File metadata is saved to Convex database:

const fileId = await saveFile({
  storageId,
  fileName: file.name,
  fileType: file.type,
  fileSize: file.size,
})

Create DuckDB Table

CSV files are processed into queryable DuckDB tables:

const result = await createTableFromCSV({
  data: { csvUrl, tableName }
})

Link Table to File

The DuckDB table name is linked back to the file record:

await updateDuckDBInfo({
  fileId,
  tableName: result.tableName,
})

Usage Patterns

Basic CSV Upload

<FileUpload 
  onUploadComplete={() => {
    // Refresh table list
    // Navigate to new table
  }} 
/>

URL Extraction with Custom Prompt

For URL extraction, craft descriptive prompts:

Extract all product information including name, price, 
description, availability status, and customer ratings

Progress Tracking

The upload component provides real-time progress feedback:

{uploading && (
  <Box mt="md">
    <Text size="sm" mb="xs">
      Uploading... {uploadProgress}%
    </Text>
    <Progress value={uploadProgress} animated />
  </Box>
)}

Progress stages:

20%: File uploaded to R2
50-65%: Metadata saved
85-95%: DuckDB table created
100%: Complete

Advanced Tips

Table Naming Convention

Table names are automatically generated from filenames:

const tableName = file.name
  .replace(/\.csv$/i, '')
  .replace(/[^a-zA-Z0-9_]/g, '_')
  .toLowerCase()

Example: Sales Data 2024.csv → sales_data_2024

Handling Large Files

Files up to 10MB are supported. For larger datasets:

Split into multiple files
Use URL extraction with pagination
Consider direct DuckDB import (see Architecture docs)

Error Recovery

If DuckDB table creation fails:

catch (duckdbError) {
  // File is still uploaded and accessible
  // Table creation can be retried
  notifications.show({
    title: 'Warning',
    message: 'File uploaded but table creation failed',
    color: 'yellow',
  })
}

Common Use Cases

Uploading Sales Data

Prepare your CSV with clean column headers
Drag and drop the file into the upload zone
Wait for processing (typically 5-15 seconds)
Start querying immediately

Extracting Web Data

Find a webpage with structured data
Click the From URL tab
Paste the URL
Write a clear extraction prompt
Click Extract Data & Create Table

Batch Uploads

You can upload multiple files at once:

<Dropzone
  onDrop={handleDrop}
  maxSize={10 * 1024 ** 2}
  accept={[MIME_TYPES.csv, MIME_TYPES.xlsx, MIME_TYPES.xls]}
>
  {/* Multiple files are processed sequentially */}
</Dropzone>

API Integration

The upload feature integrates with these Convex APIs:

api.r2 - R2 storage operations (from `/home/daytona/workspace/source/src/components/dashboard/FileUpload.tsx:131)
api.csv.saveFile - Save file metadata (from `/home/daytona/workspace/source/src/components/dashboard/FileUpload.tsx:35)
api.csv.updateDuckDBInfo - Link table to file (from `/home/daytona/workspace/source/src/components/dashboard/FileUpload.tsx:38)
api.csv.createTableFromURL - URL extraction (from `/home/daytona/workspace/source/src/components/dashboard/FileUpload.tsx:41)

For more details, see the API Reference.

Next Steps

Query Your Data

Learn how to write SQL queries against your uploaded data

AI Agents

Use AI agents to analyze and query your data automatically

Get Started

Core Features

Guides

Architecture

Overview

Upload Methods

File Upload

URL Extraction

Implementation Details

File Processing Pipeline

Usage Patterns

Basic CSV Upload

URL Extraction with Custom Prompt

Progress Tracking

Advanced Tips

Common Use Cases

Uploading Sales Data

Extracting Web Data

Batch Uploads

API Integration

Next Steps

Query Your Data

AI Agents

Build docs developers (and LLMs) love

Get Started

Core Features

Guides

Architecture

​Overview

​Upload Methods

​File Upload

​URL Extraction

​Implementation Details

​File Processing Pipeline

​Usage Patterns

​Basic CSV Upload

​URL Extraction with Custom Prompt

​Progress Tracking

​Advanced Tips

​Common Use Cases

​Uploading Sales Data

​Extracting Web Data

​Batch Uploads

​API Integration

​Next Steps

Query Your Data

AI Agents

Build docs developers (and LLMs) love

Overview

Upload Methods

File Upload

URL Extraction

Implementation Details

File Processing Pipeline

Usage Patterns

Basic CSV Upload

URL Extraction with Custom Prompt

Progress Tracking

Advanced Tips

Common Use Cases

Uploading Sales Data

Extracting Web Data

Batch Uploads

API Integration

Next Steps