Skip to content

feat: Enable FP8 (E4M3/E5M2) in concat_mla_k for optimize long-context prefill performance and refactor type dispatch for BF16/FP16#3129

Open
qiching wants to merge 2 commits intoflashinfer-ai:mainfrom
qiching:fix/concat-mla-k-fp8-support
Open

feat: Enable FP8 (E4M3/E5M2) in concat_mla_k for optimize long-context prefill performance and refactor type dispatch for BF16/FP16#3129
qiching wants to merge 2 commits intoflashinfer-ai:mainfrom
qiching:fix/concat-mla-k-fp8-support

Commits

Commits on Apr 21, 2026