Understanding the workflow
Working with datasets follows a natural progression:Searching for datasets
Thesearch_datasets tool is your starting point for discovering data on data.gouv.fr.
Query optimization
The data.gouv.fr API uses strict AND logic for searches, meaning all query terms must match. To improve results, the server automatically removes common stop words that rarely appear in dataset metadata:- Generic terms: “données”, “donnee”, “fichier”, “tableau”
- Format names: “csv”, “excel”, “xlsx”, “json”, “xml”
- Use specific, descriptive terms
- Avoid generic words like “data” or “file”
- Use domain-specific keywords (e.g., “immobilier”, “population”, “transport”)
Search parameters
Example searches
Getting dataset information
Once you’ve found a dataset, useget_dataset_info to retrieve complete metadata.
Returned information
- Title and description (short and full)
- Dataset ID and slug
- Organization details
- Tags and keywords
- Number of resources
- Creation and update dates
- License information
- Update frequency
Listing resources
Datasets contain one or more resources (files). Uselist_dataset_resources to see all available files.
Resource metadata
For each resource, you’ll receive:- Resource ID (needed for data queries)
- Title and description
- File format (CSV, JSON, XLSX, etc.)
- File size (formatted in B, KB, MB, or GB)
- MIME type
- Resource type
- Download URL
File sizes are automatically formatted for readability:
- Less than 1 KB: shown in bytes
- Less than 1 MB: shown in kilobytes
- Less than 1 GB: shown in megabytes
- 1 GB or more: shown in gigabytes
Checking resource details
Before querying data, useget_resource_info to understand the resource’s characteristics and determine the best access method.
Tabular API availability
The tool checks whether a resource is available via the Tabular API by:- Checking if the resource is in the exceptions list (large files with special support)
- Attempting to fetch the resource profile
- Reporting availability status with indicators:
- ✅ Available via Tabular API (can be queried)
- ✅ Available via Tabular API (large file exception)
- ⚠️ Not available via Tabular API (may not be tabular data)
Resource information includes
- Format and MIME type
- File size
- Download URL
- Description
- Associated dataset details
- Tabular API compatibility
Common workflows
Finding and exploring a dataset
Working with large datasets
For datasets with many resources or large files:- Start with
list_dataset_resourcesto see all files - Use
get_resource_infoto check each file’s size and format - For CSV/XLSX files under API limits, use
query_resource_data - For larger files or other formats, use
download_and_parse_resource
Next steps
Querying data
Learn how to query and download resource data
Usage metrics
Check dataset and resource usage statistics