Skip to content

add fused fp8 moe kernel for low-latency llm inference#49

Open
VAthree wants to merge 1 commit into
Tencent:mainfrom
VAthree:add-fused-fp8-moe-kernel
Open

add fused fp8 moe kernel for low-latency llm inference#49
VAthree wants to merge 1 commit into
Tencent:mainfrom
VAthree:add-fused-fp8-moe-kernel

Commits

Commits on Jun 3, 2026