Skip to content

sportsdataverse/nfl-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nfl-data

Builds NFL compiled play-by-play datasets and trains EP / WP / CP models from the raw NFL game JSON committed in sportsdataverse/nfl-raw, then publishes datasets + model artifacts to GitHub Releases on sportsdataverse/sportsdataverse-data.

Sibling of nfl-raw in the SportsDataverse -raw-data split: nfl-raw scrapes and commits raw JSON; nfl-data is the consumer that reshapes (with nflfastR parity), models, reports, and publishes. See docs/raw-to-data-migration-playbook.md and docs/superpowers/specs/2026-06-17-nfl-raw-to-data-migration-design.md.

Layout

  • python/ — uv project. native_pbp/ (compiled-PBP builder, nflfastR parity), nfl_data_ingest/ (URL-ingest of nfl-raw JSON), model_training/track6_nfl_ep_wp/ (EP/WP/CP trainer + reports), nfl_model_publish/ (artifact uploader). (Populated across SP1–SP2.)
  • R/ — dataset-parity publish toolchain (write_dataset/publish_dataset → parquet/rds/csv.gz via piggyback). (Added in SP2.)
  • docs/ — migration playbook, design spec, implementation plans, generated model reports.

Develop

cd python
uv sync
uv run pytest          # hermetic suite (integration tests deselected by default)

About

NFL compiled PBP datasets + EP/WP/CP models built from sportsdataverse/nfl-raw; published to sportsdataverse-data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors