Update configs by sgreenbury · Pull Request #352 · alan-turing-institute/autocast

sgreenbury · 2026-04-26T05:36:21Z

This pull request adds and updates several configuration files and documentation to support a new, centralized batch of planned ablation studies on the CNS (Conditioned Navier-Stokes) task. The main focus is on adding ablation configs for model architecture, loss variants, noise channels, and diffusion, as well as updating documentation to reflect the new planned batch and its implementation status. A new evaluation script for ensemble size ablation is also included.

The most important changes are:

New experiment configurations:

Added CNS ablation configs for U-Net architecture (crps_unet_azula_80m.yaml), CRPS loss variants (FairCRPS and plain CRPS), noise channels (crps_vit_noise256.yaml), and diffusion ViT baseline (diffusion_vit_large.yaml). These configs standardize architecture, loss, and training parameters for fair comparison. [1] [2] [3] [4] [5]

Documentation updates for planned CNS batch:

Updated slurm_scripts/ablations/README.md to describe the new planned CNS batch, list its runs, and mark the implementation status of each ablation. Added a section explaining the batch submission scripts and how each config/run fits into the overall study. [1] [2]
Updated architecture-specific READMEs to reflect the new U-Net config, clarify the status of FNO, and document parameter-matching decisions. [1] [2]

Loss variant and ensemble size documentation:

Marked the CRPS loss variant configs as implemented and clarified class paths for loss/metric targets. Updated the ensemble size ablation docs to include the planned n_members=4 ViT run with batch size details. [1] [2] [3] [4]

Ablation status and evaluation scripts:

Updated the status and evaluation details for cached-latent CRPS and global conditioning ablations, including instructions for reruns and eval script locations. [1] [2] [3]
Added a new eval script to select the best multi-Winkler checkpoint after 0.25 progress for ensemble size ablation runs.

Add flow matching eval submitters for 25%, 50%, and 75% training checkpoints. Include both cached-latent encode-once evals and explicit ambient variants so the progress metrics do not overwrite final-checkpoint outputs.

…0p-evals

Switch the U-Net 80M ablation config to the Azula library default ffn_factor=1, and re-budget hid_channels to [62, 124, 248, 496] to keep the canonical 1:2:4:8 doubling ratio while landing at ~81.3M params (within 0.3% of the 81.0M CRPS ViT baseline). Both backbones now use their respective Azula-canonical FFN ratios (UNet=1, ViT=4 per Vaswani 2017).

Merge the base eval defaults before generating struct-safe overrides for resolved configs, so old runs can accept newly added eval keys. This keeps explicit eval overrides working for rollout snapshots and future eval defaults.

sgreenbury added 11 commits April 24, 2026 09:55

Add FM progress eval scripts

101af07

Add flow matching eval submitters for 25%, 50%, and 75% training checkpoints. Include both cached-latent encode-once evals and explicit ambient variants so the progress metrics do not overwrite final-checkpoint outputs.

Update MAE script epoch timings

95e9a7a

Fix submission scripts

567740c

Remove ckpt log on wandb

b83b2a2

Add best multiwinkler ckpt evals

e070a89

Merge remote-tracking branch 'origin/main' into 2026-04-23/runs-with-…

3007821

…0p-evals

Add overrides for snapshots

54af299

Add ablations

bc4e76d

Update timings

9c98db0

Fix stale eval default overrides

c412803

Merge the base eval defaults before generating struct-safe overrides for resolved configs, so old runs can accept newly added eval keys. This keeps explicit eval overrides working for rollout snapshots and future eval defaults.

sgreenbury merged commit 7bc7163 into main Apr 26, 2026
3 checks passed

sgreenbury deleted the 2026-04-23/runs-with-0p-evals branch April 26, 2026 09:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update configs#352

Update configs#352
sgreenbury merged 11 commits intomainfrom
2026-04-23/runs-with-0p-evals

sgreenbury commented Apr 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sgreenbury commented Apr 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant