Skip to content

Do not rerun everything every time#228

Merged
swo merged 2 commits into
mainfrom
swo_make
Dec 30, 2025
Merged

Do not rerun everything every time#228
swo merged 2 commits into
mainfrom
swo_make

Conversation

@swo

@swo swo commented Dec 30, 2025

Copy link
Copy Markdown
Collaborator

The existing Makefile is oriented around directories, which means that make can't see when outputs have been created. This meant that, every time make was called, every single step would be rerun, regardless of what files had actually been updated.

Instead, individuals files as make targets. The diagnostics step produces an unknown number of outputs, so make a dummy status file as its output.

Also reorganize to output/RUN_ID/. Organizing by output/data/some.parquet, etc. was a speculative benefit if we wanted to have Hive-style access across multiple runs.

@swo swo marked this pull request as ready for review December 30, 2025 16:21
@swo swo merged commit 9c4b8ee into main Dec 30, 2025
4 checks passed
@swo swo deleted the swo_make branch December 30, 2025 16:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants