Commit 76c1acf
committed
GLM-5.1: drop --detokenizer-worker-num 4 (multi-detokenizer broken in dev-cu12)
The dev-cu12 sglang build crashes immediately on the first chat request
when --detokenizer-worker-num >1 is set:
File 'multi_tokenizer_mixin.py', line 494, in event_loop
ipcs is not None
AssertionError: Batch req recv_obj.rids=['...'] has invalid http_worker_ipcs
SIGQUIT received.
Reverts that one flag to default (1). Keeps --chunked-prefill-size 8192
and --watchdog-timeout 600 since those are plain config tweaks and the
watchdog increase still helps the detokenizer-falling-behind pattern.1 parent 528ff4c commit 76c1acf
1 file changed
Lines changed: 0 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
126 | 126 | | |
127 | 127 | | |
128 | 128 | | |
129 | | - | |
130 | 129 | | |
131 | 130 | | |
132 | 131 | | |
| |||
0 commit comments