You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/src/how-to/train-multistage.md
+7-7Lines changed: 7 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,30 +1,30 @@
1
-
# Train in stages
1
+
# Run multistage training
2
2
3
-
Staged training is an alternative to end-to-end training for `EncodeProcessDecode` models.
3
+
Multistage training is an alternative to [normal training](./train.md) for `EncodeProcessDecode` models.
4
4
Instead of training the full model in one pass, each component is trained in isolation before the weights are loaded into the full model for a final finetuning step.
5
5
This can be useful when the model is large, when encoder inputs have very different characteristics, or when you want finer control over the training of individual components.
6
6
7
7
## The four stages
8
8
9
-
**Stage 1 — Train encoders**
9
+
**Stage 1 - Train encoders**
10
10
11
11
Each encoder is trained independently as a standalone autoencoder (encoder + disposable decoder). One training run per encoder.
Pretrained weights are loaded into the full `EncodeProcessDecode` model and the entire model is trained end-to-end.
30
30
@@ -36,7 +36,7 @@ Pretrained weights are loaded into the full `EncodeProcessDecode` model and the
36
36
uv run imp train --multistage
37
37
```
38
38
39
-
A checkpoint is saved at the end of each stage. To resume a partially completed run, pass `--checkpoint-dir` pointing at the checkpoint directory from the original run — any stage whose checkpoint already exists there will be skipped:
39
+
A checkpoint is saved at the end of each stage. To resume a partially completed run, pass `--checkpoint-dir` pointing at the checkpoint directory from the original run - any stage whose checkpoint already exists there will be skipped:
40
40
41
41
```bash
42
42
uv run imp train --multistage --checkpoint-dir ${BASE_DIR}/training/wandb/run-<date>-<id>/checkpoints
0 commit comments