Artifact size limit
The submission artifact is computed as code bytes + compressed model bytes. The cap is decimal 16 MB:The limit is 16,000,000 decimal bytes, not 16 MiB (16,777,216 bytes). Compressed model size is measured as the int8 + zlib roundtrip output of your model weights.
Required files
Every submission folder must contain the following four files:README.md
Explains the submission in reasonable detail — architecture, training config, key metrics, and the command used to reproduce the run.
submission.json
Structured metadata including your name, GitHub ID,
val_bpb, and byte counts. See format below.train.log
The exact training log automatically produced by your training script. Do not edit or truncate it.
train_gpt.py
The training script snapshot used for the run. It must compile and run successfully from within the records folder.
Folder structure and naming
Submissions live under one of two tracks:submission.json format
Thesubmission.json file provides structured metadata for the leaderboard. All fields are required unless noted.
val_bpb is the post-quantization roundtrip bits-per-byte score, printed as final_int8_zlib_roundtrip_exact val_bpb in your train log. This is the official score used for the leaderboard.Evaluation constraints
The following constraints apply to all submissions during evaluation:- No external downloads or network calls during evaluation
- No access to training data during evaluation (unless those bits are counted within the 16 MB limit)
- No data from outside the allowed FineWeb validation set
Submitting
Submit your entry as a pull request that adds only your new folder to the appropriate/records subfolder. The PR should not modify any other files in the repository.
