We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 2f7e7ff commit 35be871Copy full SHA for 35be871
perf-changelog.yaml
@@ -1249,4 +1249,6 @@
1249
description:
1250
- "Optimize MiniMax-M2.5 FP8 MI355X vLLM search-space"
1251
- "Add tp2 ep2 search-space entries (conc 2-256) for all seq-len configs"
1252
+ - "Upgrade vLLM image to v0.19.0"
1253
+ - "Enable FP8 KV cache + AITER FA for minimaxm2.5-fp8-mi355x-vllm"
1254
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1003
0 commit comments