Skip to content

Commit 6930c7c

Browse files
committed
[fix] fix qwen policy, now use gather output as logits
1 parent bc6e14a commit 6930c7c

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

applications/ColossalChat/coati/distributed/consumer.py

+3-2
Original file line numberDiff line numberDiff line change
@@ -66,10 +66,11 @@ def setup(self) -> None:
6666
launch(self.rank, self.world_size, self.master_addr, self.master_port, local_rank=0)
6767

6868
plugin_config = dict(
69-
tp_size=1,
69+
tp_size=2,
7070
pp_size=1,
7171
precision="bf16",
72-
zero_stage=1,
72+
zero_stage=2,
73+
parallel_output=False,
7374
)
7475
if self.plugin_config.get("pp_size", 1) > 1 and "num_microbatches" not in self.plugin_config:
7576
plugin_config["microbatch_size"] = self.microbatch_size

0 commit comments

Comments
 (0)