Skip to main content
Data science projects accumulate a large number of choices — which dataset to use, which algorithm to try, why a certain approach was accepted or rejected. Without a structured record, those decisions become invisible and your work becomes hard to reproduce or audit. Babel’s traceability panel lets you log three types of records directly within each CRISP-DM phase: datasets, experiments, and decisions. Every record is linked to a phase, so you always know when in the lifecycle something happened.

Accessing the traceability panel

Expand any phase card on the project pipeline page. Below the Kanban board you will find the Traceability panel, with tabs for Datasets, Experiments, and Decisions. Records displayed are automatically filtered to the current phase.

Phase filter

Each record type supports a ?phase= query filter. In the UI, the traceability panel automatically applies the filter for the expanded phase, so you only see records relevant to that part of the lifecycle. You can switch to a different phase using the phase selector at the top of the panel if you need to review records across phases.

Record types

A dataset record documents a data source that was ingested or used during this phase. Log it as soon as you acquire a new dataset so the team knows its origin, size, and acquisition context.To add a dataset: open the traceability panel, select the Datasets tab, and click Add dataset.
name
string
required
A short, recognizable name for the dataset. For example: Customer transactions Q1 2024.
source
string
required
Where the dataset came from — a database name, file path, external URL, or API endpoint. This is the primary reference for reproducibility.
description
string
required
A plain-text description of what the dataset contains, its domain, any known quality issues, and how it relates to the project objectives.
records
number
required
The number of rows or records in the dataset at the time of acquisition. Useful for tracking if the dataset changes over time.
acquisitionDate
string
required
The date the dataset was obtained, in ISO-8601 format (YYYY-MM-DD). Used to establish a timeline in the audit trail.
phase
TaskPhase
required
The CRISP-DM phase during which this dataset was acquired. Accepted values: business, data_understanding, data_preparation, modeling, evaluation, deployment.
Record every version of a dataset separately if it changes significantly between phases (for example, the raw extract vs the cleaned version after ETL).

Deleting a traceability record

Each record card has a Delete button (trash icon). Deletion is immediate and permanent after confirmation.
Deleting a traceability record removes it from the project report as well. If the record has already been included in a shared PDF report, the report is unaffected — but future report exports will not include the deleted record.

Traceability in the project report

When you export a PDF report from the project dashboard, all datasets, experiments, and decisions are included in dedicated sections, organized by phase. This makes the traceability log directly usable as an appendix to a client deliverable or internal review document.

Build docs developers (and LLMs) love