You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/Pretraining.md
+15-4Lines changed: 15 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -39,7 +39,7 @@ This section covers:
39
39
40
40
### Prerequisites
41
41
42
-
- Python 3.12+
42
+
- Python 3.11+
43
43
- CUDA-capable GPU (recommended: 40GB+ VRAM)
44
44
- Linux/macOS environment
45
45
@@ -63,6 +63,14 @@ This section covers:
63
63
pre-commit install
64
64
```
65
65
66
+
### Running on Docker
67
+
68
+
We run our training scripts using the `olmo-core-tch271cu128-2025-09-15` Docker image published by [ai2-olmo-core](https://github.com/allenai/OLMo-core/blob/main/README.md).
69
+
70
+
**Important Notes:**
71
+
- The code from this repository is **not included** in the Docker image to aid in active development. The code is mounted or copied at runtime.
72
+
- This Docker image may not work on your own cluster if you have different hardware or driver/CUDA versions. The image is built for CUDA 12.8 with PyTorch 2.7.1.
73
+
-**For adaptation:** See our [Dockerfile](../Dockerfile) to understand how to build an image compatible with your hardware and CUDA setup
66
74
67
75
## Launching Scripts
68
76
@@ -200,14 +208,16 @@ Evaluation datasets have default paths set in [`olmoearth_pretrain/evals/dataset
200
208
201
209
1. Download/prepare the evaluation datasets locally
202
210
2. Set environment variables (see [Environment Variables](#environment-variables))
203
-
3. Or disable evaluations you don't have by adding the following override to your command:
211
+
3. If not using all evaluations, enable only the ones you have set up by adding an override:
212
+
213
+
e.g to only run mados and pastis_sentinel2 evals add the following overide.
The task names correspond to the user-chosen names specified in the training configuration
208
218
209
219
---
210
-
### Main Training Scripts
220
+
### Official Training Scripts
211
221
> **🏢 AI2 Researchers - Choose Your Launch Method:**
212
222
>
213
223
> **For Beaker Batch Jobs (Pre-emptible):**
@@ -230,6 +240,7 @@ Evaluation datasets have default paths set in [`olmoearth_pretrain/evals/dataset
230
240
>
231
241
> See [Setup-Internal.md](Setup-Internal.md#launch-methods) for more details.
232
242
243
+
All Official release scripts can be found at [`scripts/official/`](../scripts/official/).
233
244
Below is a table demonstrating how to launch various model sizes using `torchrun` (for external users and AI2 sessions). Adjust the dataset path and configuration overrides as needed for your setup.
234
245
235
246
| Model Size | Script | Hardware | Example Command | Notes |
0 commit comments