Do not rerun everything every time by swo · Pull Request #228 · CDCgov/cfa-vaccination-coverage-forecasting

swo · 2025-12-30T16:20:37Z

The existing Makefile is oriented around directories, which means that make can't see when outputs have been created. This meant that, every time make was called, every single step would be rerun, regardless of what files had actually been updated.

Instead, individuals files as make targets. The diagnostics step produces an unknown number of outputs, so make a dummy status file as its output.

Also reorganize to output/RUN_ID/. Organizing by output/data/some.parquet, etc. was a speculative benefit if we wanted to have Hive-style access across multiple runs.

swo added 2 commits December 30, 2025 11:17

Do not rerun everything every time

e511850

Increment version

e1d05c2

swo marked this pull request as ready for review December 30, 2025 16:21

swo merged commit 9c4b8ee into main Dec 30, 2025
4 checks passed

swo deleted the swo_make branch December 30, 2025 16:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Do not rerun everything every time#228

Do not rerun everything every time#228
swo merged 2 commits into
mainfrom
swo_make

swo commented Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

swo commented Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants