Update ablation scripts for ensemble and model size evaluations by sgreenbury · Pull Request #340 · alan-turing-institute/autocast

sgreenbury · 2026-04-22T11:56:48Z

This pull request adds a comprehensive set of configuration files and documentation to support ablation studies and sensitivity sweeps for model size, ensemble size, loss function variants, architectural comparisons, and conditioning strategies in the CNS (Conditioned Navier-Stokes) and related datasets. The changes enable flexible experimentation and evaluation workflows by introducing new YAML configs for model variants, detailed README guides for each ablation, and an eval script for ensemble size studies.

Key changes include:

Model Size Ablation Configs

Added four new YAML configuration files under local_hydra/local_experiment/ablations/model_size/conditioned_navier_stokes/ to support model size sweeps for both CRPS-ViT and FM-ViT models, providing 0.4x and 2x parameter count variants. [1] [2] [3] [4]

Documentation for Ablation Studies

Introduced a top-level README.md in slurm_scripts/ablations/ outlining the scope, current status, design notes, and workflow for all ablation, comparison, and sweep experiments.
Added detailed README files for specific ablation studies, including:
- Ensemble size (ensemble_size/README.md) with batch regime details and scheduling instructions.
- CRPS loss variants (crps_variants/README.md) with implementation sketches for swapping loss functions.
- Architecture comparison between U-Net, FNO, and ViT (arch_unet_fno_vit/README.md).
- Cached-latent CRPS loss study (cached_latent_crps/README.md).
- Conditioning strategies: global vs permute (cond_global_vs_permute/README.md).

Evaluation Script for Ensemble Size Ablation

Added submit_eval_crps_ambient.sh under slurm_scripts/ablations/ensemble_size/eval/ to automate evaluation of ensemble-size ablation runs, with consistent evaluation parameters and clear documentation.

These changes provide a structured foundation for running, extending, and documenting a wide range of ablation and comparison studies in the project.

Label matches the honest measured ~2.09x / ~2.10x scaling rather than the imprecise 160M target. Updates preset filenames, variant IDs, wandb names, and README accordingly.

Extends the scan to 3 points per architecture (0p4x, baseline, 2x) using aspect-preserving, heads-fixed scaling. Keeps the smaller point at more-standard transformer dimensions to avoid confounding from overly narrow / shallow settings.

… into 2026-04-19/ablation-scripts

Pin comparison eval submitters to n_members=10 explicitly. This keeps reruns and future submissions aligned even if the global eval default changes later.

sgreenbury added 19 commits April 20, 2026 16:57

Add initial ablation stubs

2210cf9

Add initial ensemble ablation scripts

6d0b371

Update ablations for ensemble size

b83fda0

Add CNS m16 cosine epochs from timing runs

49d6dfc

Update ablation output path naming

0db40e1

Add ensemble size ablation for GPE, AD, GS

2700ad9

Remove CNS dataset in ensemble ablation scripts as run

5dc7783

Update ablation

b93e0c9

Add large model ablation

03594c2

Rename model-size ablation variants from _160m to _2x

3371418

Label matches the honest measured ~2.09x / ~2.10x scaling rather than the imprecise 160M target. Updates preset filenames, variant IDs, wandb names, and README accordingly.

Update max_epoch placeholders

ac1bb06

Add timings and update to 2x only for now

3a69487

Add eval for ablations

e10efdb

Make script executable

a5072fb

Merge remote-tracking branch 'origin/2026-04-21/ablation-large-model'…

e0848cf

… into 2026-04-19/ablation-scripts

Add model size eval

0b4fe47

Update eval n_members overrides

22ea103

Pin comparison eval submitters to n_members=10 explicitly. This keeps reruns and future submissions aligned even if the global eval default changes later.

Update eval ablation script

017144d

sgreenbury merged commit 4fe6d1a into main Apr 22, 2026
3 checks passed

sgreenbury deleted the 2026-04-19/ablation-scripts branch April 22, 2026 11:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update ablation scripts for ensemble and model size evaluations#340

Update ablation scripts for ensemble and model size evaluations#340
sgreenbury merged 19 commits intomainfrom
2026-04-19/ablation-scripts

sgreenbury commented Apr 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sgreenbury commented Apr 22, 2026

Model Size Ablation Configs

Documentation for Ablation Studies

Evaluation Script for Ensemble Size Ablation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant