Rank max learning rate by Hgherzog · Pull Request #505 · allenai/olmoearth_pretrain

Hgherzog · 2026-02-25T19:53:45Z

Add rank_max_lr to enable per-rank learning rates for linear probe evaluation, providing a free learning rate sweep during in-loop evaluations.

- Each rank uses a different LR from a log-spaced range (2 orders of magnitude) - After evaluation, all_reduce MAX to get best score across ranks - Auto-disables if world_size < 2 or > 20 - New flag: DownstreamTaskConfig.rank_max_lr (default: False) Co-authored-by: henryh <henryh@allenai.org>

cursor · 2026-02-25T19:53:47Z

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
_{Learn more about Cursor Agents}

- LRs: [1e-4, 5e-4, 1e-3, 5e-3, 1e-2, 5e-2, 1e-1, 5e-1] - Auto-disable if world_size != 8 (must match number of sweep LRs) Co-authored-by: henryh <henryh@allenai.org>

Co-authored-by: henryh <henryh@allenai.org>

… low priority Co-authored-by: henryh <henryh@allenai.org>

gabrieltseng · 2026-03-05T08:06:42Z

+RANK_MAX_LRS = [1e-4, 5e-4, 1e-3, 5e-3, 1e-2, 5e-2, 1e-1, 5e-1]
+
+
+def get_rank_lr(rank: int, world_size: int) -> float | None:


world_size is not necessary for this function

gabrieltseng · 2026-03-05T08:07:28Z

    select_final_test_miou_based_on_epoch_of_max_val_miou: bool = False,
    n_bootstrap: int = 0,
    bootstrap_seed: int = 42,
+    rank_max_lr: bool = False,


i feel like this should be true by default?

github-actions Bot added the size/s label Feb 25, 2026

cursoragent and others added 2 commits February 25, 2026 20:01

Use standard eval sweep LRs for rank_max_lr

fa8117b

- LRs: [1e-4, 5e-4, 1e-3, 5e-3, 1e-2, 5e-2, 1e-1, 5e-1] - Auto-disable if world_size != 8 (must match number of sweep LRs) Co-authored-by: henryh <henryh@allenai.org>

Add launch script for rank_max_lr comparison experiments

0d53da1

Co-authored-by: henryh <henryh@allenai.org>

github-actions Bot added the size/m label Feb 25, 2026

cursoragent and others added 2 commits February 25, 2026 20:10

Fix launch script: use .value/.unit for Duration overrides, 8k steps,…

7aeff69

… low priority Co-authored-by: henryh <henryh@allenai.org>

simplify how this works

c06d182

gabrieltseng reviewed Mar 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rank max learning rate#505

Rank max learning rate#505
Hgherzog wants to merge 5 commits intomainfrom
cursor/rank-max-learning-rate-9827

Hgherzog commented Feb 25, 2026

Uh oh!

cursor Bot commented Feb 25, 2026

Uh oh!

gabrieltseng Mar 5, 2026 •

edited

Loading

Uh oh!

gabrieltseng Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		RANK_MAX_LRS = [1e-4, 5e-4, 1e-3, 5e-3, 1e-2, 5e-2, 1e-1, 5e-1]


		def get_rank_lr(rank: int, world_size: int) -> float \| None:

Conversation

Hgherzog commented Feb 25, 2026

Uh oh!

cursor Bot commented Feb 25, 2026

Uh oh!

gabrieltseng Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gabrieltseng Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gabrieltseng Mar 5, 2026 •

edited

Loading