Skip to content

Update configs#352

Merged
sgreenbury merged 11 commits intomainfrom
2026-04-23/runs-with-0p-evals
Apr 26, 2026
Merged

Update configs#352
sgreenbury merged 11 commits intomainfrom
2026-04-23/runs-with-0p-evals

Conversation

@sgreenbury
Copy link
Copy Markdown
Contributor

This pull request adds and updates several configuration files and documentation to support a new, centralized batch of planned ablation studies on the CNS (Conditioned Navier-Stokes) task. The main focus is on adding ablation configs for model architecture, loss variants, noise channels, and diffusion, as well as updating documentation to reflect the new planned batch and its implementation status. A new evaluation script for ensemble size ablation is also included.

The most important changes are:

New experiment configurations:

  • Added CNS ablation configs for U-Net architecture (crps_unet_azula_80m.yaml), CRPS loss variants (FairCRPS and plain CRPS), noise channels (crps_vit_noise256.yaml), and diffusion ViT baseline (diffusion_vit_large.yaml). These configs standardize architecture, loss, and training parameters for fair comparison. [1] [2] [3] [4] [5]

Documentation updates for planned CNS batch:

  • Updated slurm_scripts/ablations/README.md to describe the new planned CNS batch, list its runs, and mark the implementation status of each ablation. Added a section explaining the batch submission scripts and how each config/run fits into the overall study. [1] [2]
  • Updated architecture-specific READMEs to reflect the new U-Net config, clarify the status of FNO, and document parameter-matching decisions. [1] [2]

Loss variant and ensemble size documentation:

  • Marked the CRPS loss variant configs as implemented and clarified class paths for loss/metric targets. Updated the ensemble size ablation docs to include the planned n_members=4 ViT run with batch size details. [1] [2] [3] [4]

Ablation status and evaluation scripts:

  • Updated the status and evaluation details for cached-latent CRPS and global conditioning ablations, including instructions for reruns and eval script locations. [1] [2] [3]
  • Added a new eval script to select the best multi-Winkler checkpoint after 0.25 progress for ensemble size ablation runs.

Add flow matching eval submitters for 25%, 50%, and 75% training checkpoints. Include both cached-latent encode-once evals and explicit ambient variants so the progress metrics do not overwrite final-checkpoint outputs.
Switch the U-Net 80M ablation config to the Azula library default
ffn_factor=1, and re-budget hid_channels to [62, 124, 248, 496] to
keep the canonical 1:2:4:8 doubling ratio while landing at ~81.3M
params (within 0.3% of the 81.0M CRPS ViT baseline). Both backbones
now use their respective Azula-canonical FFN ratios (UNet=1, ViT=4
per Vaswani 2017).
Merge the base eval defaults before generating struct-safe overrides for
resolved configs, so old runs can accept newly added eval keys.

This keeps explicit eval overrides working for rollout snapshots and future
eval defaults.
@sgreenbury sgreenbury merged commit 7bc7163 into main Apr 26, 2026
3 checks passed
@sgreenbury sgreenbury deleted the 2026-04-23/runs-with-0p-evals branch April 26, 2026 09:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant