You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[nightshift] fix documentation drift in tutorials (#3966)
> *Pages left behind,*
> *code rewrites its own story—*
> *docs echo the old.*
- **`add-optimizer.md`**: Fixed `TrainLmConfig` import path
(`levanter.trainer` → `levanter.main.train_lm`)
- **`first-experiment.md`**: Added required `gpu_type` arg to
`ResourceConfig.with_gpu(count=1)` → `ResourceConfig.with_gpu("H100",
count=1)`
- **`train-an-lm.md`**: Added required `gpu_type` arg to
`ResourceConfig.with_gpu(count=4)`, added missing `EvalTaskConfig`
import, fixed trailing comma
- **`train-dpo.md`**: `llama_3_1_8b` is a module-level `LlamaConfig`
variable, not a callable — removed erroneous `()` that would cause
`TypeError`
- **`train_test_overlap.md`**: Renamed `DedupeConfig` → `DeconConfig`
and `DedupMode` → `DeconMode` to match actual class names in
`marin.processing.classification.decon`
train_batch_size=BATCH_SIZE, # Sequences processed per step
75
78
num_train_steps=NUM_TRAIN_STEPS, # Total optimization steps
76
79
learning_rate=3e-3, # Peak learning rate
@@ -116,7 +119,7 @@ model = default_train(
116
119
model_config=model_config, # Model architecture
117
120
train_config=training_config, # Training hyperparameters
118
121
tags=["${YOUR_TAG1}", "${YOUR_TAG2}"], # Tags for experiment tracking
119
-
eval_harness_tasks= [EvalTaskConfig("mmlu", 0, task_alias="mmlu_0shot"), EvalTaskConfig("mmlu", 5, task_alias="mmlu_5shot")] # Evaluation Tasks to run on the checkpoint
122
+
eval_harness_tasks= [EvalTaskConfig("mmlu", 0, task_alias="mmlu_0shot"), EvalTaskConfig("mmlu", 5, task_alias="mmlu_5shot")], # Evaluation Tasks to run on the checkpoint
0 commit comments