Skip to content

Commit aeaef24

Browse files
committed
update example
1 parent 3daa326 commit aeaef24

1 file changed

Lines changed: 5 additions & 1 deletion

File tree

recipes/qwen3.5/fp8/disagg/mooncake/profile/1p1d-dep4-nsys-profile-slowdown.yaml

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
# Qwen3.5-397B-A17B-FP8 disaggregated 1P1D: TP4 prefill + DEP4 decode (Mooncake, staging buffer).
22
# Model / resources / backend / sglang_config: copied from
33
# recipes/qwen3.5/fp8/disagg/mooncake/profile/1p1d-dep4-nsys-profile.yaml
4-
# This file changes profiling + benchmark only, to pair nsys with SGLang decode /slow_down.
4+
# This file changes frontend + profiling + benchmark only, to pair nsys with SGLang decode /slow_down.
55
#
66
# Slow-down is meant to be used with SA-Bench warmup skipped (num_warmup_mult: 0). The
77
# separate benchmark warmup is disabled so step indices stay predictable; the role of
@@ -32,6 +32,10 @@ resources:
3232
prefill_workers: 1
3333
decode_workers: 1
3434

35+
frontend:
36+
type: "sglang"
37+
enable_multiple_frontends: false
38+
3539
backend:
3640

3741
prefill_environment:

0 commit comments

Comments
 (0)