Evaluation tutorial by yawenzzzz · Pull Request #412 · allenai/olmoearth_pretrain

yawenzzzz · 2025-10-24T18:11:54Z

PR #408 has been merged in

Merge remote-tracking branch 'origin/olmo-core-2.3' into henryh/pre-train-tutorial

…ker import for types

…20251024_eval_doc

cursor · 2025-10-24T18:14:03Z

-    config.launch.launch(follow=False)
+    config.launch.launch(
+        follow=False, torchrun=True
+    )  # always run with torchrun so you can run distributed scripts optionally on single gpu


Bug: Optional Launch Config Causes Errors

The launch field in OlmoEarthExperimentConfig is now optional, but the launch and launch_prep functions don't account for this. They attempt to access attributes or call methods on config.launch directly, which results in an AttributeError if no launch configuration is provided.

Additional Locations (1)

olmoearth_pretrain/internal/experiment.py#L267-L270

cursor · 2025-10-24T18:14:03Z

+    if is_running_in_beaker() and beaker_user is None:
        raise ValueError(
-            "Failed to get Beaker username. Make sure you are authenticated with Beaker."
+            "Failed to get Beaker username. Make sure you are authenticated with Beaker if you are not running on a local cluster."


Bug: Beaker Username Check Fails

The check for a missing Beaker username when running in Beaker is ineffective. beaker_user is assigned ANONYMOUS_USER if get_beaker_username() is None, which makes the subsequent beaker_user is None condition always false. This prevents the intended ValueError from being raised for unauthenticated Beaker users.

cursor · 2025-10-24T18:14:03Z

+            num_workers=0,
+            pooling_type=PoolingType.MEAN,
+            norm_stats_from_pretrained=True,
+            eval_interval=Duration.steps(2),


Bug: Frequent Evaluation Interval in Production

The eval_interval for "m-eurosat" is set to Duration.steps(2) which means evaluation runs every 2 steps. This is extremely frequent and likely a debug value that was accidentally left in production code. Other tasks use 4000 or 20000 steps, so this should probably be similar.

pjreddie and others added 28 commits October 20, 2025 09:07

bump olmo core version and add g cloud compute to reqs

ac17273

:q

3e6cd65

Merge remote-tracking branch 'origin/olmo-core-2.3' into henryh/pre-train-tutorial

Able to hit the dataset not here error

096a75f

training works

73595ac

add in the other files

f5687db

path to have pretraining work outside beaker but still requires a bea…

d92bda1

…ker import for types

move paths out to a seperate file that loads as env vars

c1d5f88

more clean ups

0ffbd91

split out sickle processor

b37822a

cull imports

2cb7702

training runs decoupled from evaluation

a5d5975

official scripts ready

8c0fbc0

add docs example

2985c8c

updated docs still need some more work

526157e

updated pretraining.md

afcc2b0

pre-training docs

eb70de6

works on a beaker session

cacbc8d

update official scripts

ba598b5

update tutorial order

f7b77be

add priority note

deded88

spelling

5965db0

actually enable torchrun

86605fa

simplify as we are required to have it for all

e23d572

formatting changes

cd150e2

linting fixes

1d9a8a6

fix mor elints

41fa6fc

Merge remote-tracking branch 'origin/henryh/pre-train-tutorial' into …

2efc7d9

…20251024_eval_doc

eval doc

a67a34a

github-actions Bot added the size/xl label Oct 24, 2025

cursor Bot reviewed Oct 24, 2025

View reviewed changes

yawenzzzz added 3 commits October 24, 2025 11:32

update experiment paths

753ee2e

update the models that are supported

7118f7b

eval changes

c6a9206

gabrieltseng mentioned this pull request Oct 30, 2025

Readme #417

Merged

3 tasks

yawenzzzz closed this Oct 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation tutorial#412

Evaluation tutorial#412
yawenzzzz wants to merge 31 commits intomainfrom
20251024_eval_doc

yawenzzzz commented Oct 24, 2025

Uh oh!

cursor Bot Oct 24, 2025

Uh oh!

cursor Bot Oct 24, 2025

Uh oh!

cursor Bot Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yawenzzzz commented Oct 24, 2025

Uh oh!

cursor Bot Oct 24, 2025

Choose a reason for hiding this comment

Bug: Optional Launch Config Causes Errors

Uh oh!

cursor Bot Oct 24, 2025

Choose a reason for hiding this comment

Bug: Beaker Username Check Fails

Uh oh!

cursor Bot Oct 24, 2025

Choose a reason for hiding this comment

Bug: Frequent Evaluation Interval in Production

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants