File tree Expand file tree Collapse file tree 2 files changed +2
-9
lines changed
Expand file tree Collapse file tree 2 files changed +2
-9
lines changed Original file line number Diff line number Diff line change @@ -222,7 +222,7 @@ qwen3.5-fp8-mi300x-sglang:
222222 - { tp: 8, conc-start: 4, conc-end: 64 }
223223
224224glm5-fp8-mi355x-sglang :
225- image : lmsysorg/sglang :v0.5.10 -rocm720-mi35x
225+ image : rocm/sgl-dev :v0.5.8.post1 -rocm720-mi35x-20260219
226226 model : zai-org/GLM-5-FP8
227227 model-prefix : glm5
228228 runner : mi355x
Original file line number Diff line number Diff line change 12851285 - " Runner script updated to clone NVIDIA/srt-slurm and map vLLM container image"
12861286 pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1008
12871287
1288- - config-keys :
1289- - glm5-fp8-mi355x-sglang
1290- description :
1291- - " Upgrade SGLang image to v0.5.10"
1292- - " Resolve the issue: https://github.com/sgl-project/sglang/issues/19028"
1293- pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1014
1294-
12951288- config-keys :
12961289 - minimaxm2.5-fp8-b200-vllm
12971290 description :
12981291 - " Update MiniMax-M2.5 FP8 B200 config with new search spaces"
12991292 pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1010
1300-
1293+
13011294- config-keys :
13021295 - minimaxm2.5-fp4-b200-vllm
13031296 description :
You can’t perform that action at this time.
0 commit comments