Skip to content

Commit 9da978d

Browse files
committed
fix ep
1 parent 54ade48 commit 9da978d

File tree

1 file changed

+1
-3
lines changed

1 file changed

+1
-3
lines changed

benchmarks/single_node/kimik2.5_fp4_b200.sh

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,6 @@ source "$(dirname "$0")/../benchmark_lib.sh"
55
check_env_vars \
66
MODEL \
77
TP \
8-
EP_SIZE \
98
CONC \
109
ISL \
1110
OSL \
@@ -29,8 +28,7 @@ PORT=${PORT:-8888}
2928

3029
set -x
3130
vllm serve $MODEL --host 0.0.0.0 --port $PORT \
32-
--tensor-parallel-size $TP \
33-
--expert-parallel-size $EP_SIZE \
31+
--tensor-parallel-size=$TP \
3432
--gpu-memory-utilization 0.90 \
3533
--max-model-len $MAX_MODEL_LEN \
3634
--max-num-seqs $CONC \

0 commit comments

Comments
 (0)