Skip to main content

Welcome to RaceData

RaceData is a comprehensive Formula 1 dataset archive maintained by TracingInsights. This dataset contains detailed race data spanning from 1950 through the current season, with automated updates deployed within 3 hours of each race finish.

What is RaceData?

RaceData provides a complete, structured collection of Formula 1 data sourced from Kaggle datasets and released under the CC0: Public Domain license. The data is automatically synchronized and made available through multiple distribution channels including GitHub releases and HuggingFace Datasets.

Key Features

Historical Coverage

Complete race data from 1950 to present, covering over 70 years of Formula 1 history

Automated Updates

Data refreshed automatically within 3 hours of race completion via GitHub Actions

18 Data Tables

Comprehensive schema including circuits, drivers, constructors, lap times, pit stops, and more

Multiple Access Methods

Available via GitHub releases, HuggingFace Datasets, and direct download

What’s Included

The dataset consists of 18 CSV tables covering all aspects of Formula 1 racing:
  • Core Data: Races, drivers, constructors, circuits, seasons
  • Performance Metrics: Lap times, qualifying results, race results, sprint results
  • Standings: Driver standings, constructor standings
  • Events: Pit stops, safety cars, red flags, virtual safety car estimates
  • Historical Records: Fatal accidents (drivers and marshalls), status codes
Each table is carefully structured with relational keys, allowing you to join data across multiple dimensions for sophisticated analysis.

Who Should Use RaceData?

Perfect for building predictive models, conducting statistical analysis, and exploring performance trends across teams and drivers.
Ideal for academic research, thesis work, and sports analytics studies with a comprehensive, well-documented dataset.
Build applications, dashboards, and tools with access to structured, machine-readable F1 data via multiple APIs.
Explore historical trends, create visualizations, and dive deep into the statistics behind your favorite sport.

License & Usage

RaceData is released under the CC0 1.0 Universal (Public Domain) license. You can use this data freely for:
  • Data analysis and research
  • Visualization projects
  • Academic work
  • Personal and commercial applications
Attribution to TracingInsights is appreciated but not required.
TracingInsights and this dataset are unofficial and are not associated with Formula 1 companies. F1, FORMULA ONE, FORMULA 1, FIA FORMULA ONE WORLD CHAMPIONSHIP, GRAND PRIX and related marks are trademarks of Formula One Licensing B.V.

Next Steps

Quickstart

Get started with downloading and using the data in minutes

Data Schema

Explore the complete data dictionary and table relationships

Analysis Guides

See practical examples of data analysis and visualization

Data Access

Learn how to download and access the data

Support the Project

RaceData is maintained by TracingInsights. You can support the project by:

Data Sources & Credits

This dataset aggregates data from: Special thanks to the original data maintainers and the Formula 1 community.

Build docs developers (and LLMs) love