Skip to content

[wip] OLMo3 anneals#126

Open
aetting wants to merge 193 commits intomainfrom
olmo3-anneals
Open

[wip] OLMo3 anneals#126
aetting wants to merge 193 commits intomainfrom
olmo3-anneals

Conversation

@aetting
Copy link
Contributor

@aetting aetting commented Jul 1, 2025

branch for running anneals from OLMo3 checkpoint

@aetting aetting requested a review from undfined July 1, 2025 20:10
Copy link
Collaborator

@undfined undfined left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few small things regarding paths to consider and some inconsistencies to resolve before launching. Namely, make sure to match activation_checkpointing from smoketest example and drop initial_lr from the anneal configs to match smoketest.

rank_microbatch_size: 8192
scheduler_type: linear
warmup_steps: 0
activation_checkpointing: true
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add this to your anneal configs above.

Copy link
Collaborator

@undfined undfined left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

cursor[bot]

This comment was marked as outdated.

cursor[bot]

This comment was marked as outdated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants