Open
Description
- allow sft / dpo to evaluate intermediate checkpoints (though prob less important)
- oe-eval push to datalake automatically Push evaluation results into the datalake #514
- always push the models to beaker dataset, but allow people to disable it (useful for 405B: this way people are not always uploading hundreds of shards for a day) Add
--try_auto_save_to_beaker
arg #505 - refactor the submit jobs stuff. Everything should happen in mason. See https://allenai.slack.com/archives/C0836QPULRK/p1736353091821699
- better caching stuff Add documentation on caching models. #504
- merge tokenization logic, and allow for pre-tokenization caching. #510
- in sft / dpo, always record the global batch size like done in ppo
- Bump dependencies uv2 #492
Metadata
Metadata
Assignees
Labels
No labels