feat: Enable FP8 (E4M3/E5M2) in concat_mla_k for optimize long-context prefill performance and refactor type dispatch for BF16/FP16#3129
Open
qiching wants to merge 2 commits intoflashinfer-ai:mainfrom
Commits
Commits on Apr 21, 2026
- committed
Albert Cheng (Engrg-Hardware 1) - committed
Albert Cheng (Engrg-Hardware 1)