Commit 6383474
authored
Add scripts/submit_eval_jobs_new.py (#1638)
* Add scripts/submit_eval_jobs_new.py to submit olmo-eval-internal jobs via Beaker. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
* cleaned up script
* Address PR review: dedup CHANGELOG, sanitize names, gate Weka mounts, use safe_dump. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
* Address PR review: rename submit_eval_jobs scripts; add --olmo_eval_ref; deprecate old script. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
* Point auto-launched evals at submit_eval_jobs_old.py to keep existing flag set working. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
* Add PR link to CHANGELOG entry. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>1 parent 5e93387 commit 6383474
7 files changed
Lines changed: 961 additions & 717 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
| 53 | + | |
53 | 54 | | |
54 | 55 | | |
55 | 56 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
| 6 | + | |
| 7 | + | |
6 | 8 | | |
7 | 9 | | |
8 | 10 | | |
| |||
96 | 98 | | |
97 | 99 | | |
98 | 100 | | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1237 | 1237 | | |
1238 | 1238 | | |
1239 | 1239 | | |
1240 | | - | |
| 1240 | + | |
1241 | 1241 | | |
1242 | 1242 | | |
1243 | 1243 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
38 | 38 | | |
39 | 39 | | |
40 | 40 | | |
41 | | - | |
| 41 | + | |
42 | 42 | | |
43 | 43 | | |
44 | 44 | | |
| |||
0 commit comments