Skip to content

Commit ed6ff5c

Browse files
committed
fix: parse moe_layer_freq from string to list in Megatron config adapter
1 parent 4b9c0a3 commit ed6ff5c

File tree

1 file changed

+0
-4
lines changed

1 file changed

+0
-4
lines changed

tests/trainer/test_megatron_trainer.py

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -145,16 +145,12 @@ def test_deepseek_v2_lite(self):
145145
exp_path="examples/megatron/configs/MI300X/deepseek_v2_lite-pretrain.yaml",
146146
env_override={},
147147
extra_args=[
148-
"--num_layers",
149-
"4",
150148
"--train_iters",
151149
"3",
152150
"--micro_batch_size",
153151
"1",
154152
"--global_batch_size",
155153
"8",
156-
"--moe_layer_freq",
157-
"[0]*1+[1]*3",
158154
"--expert_model_parallel_size",
159155
"8",
160156
],

0 commit comments

Comments
 (0)