add fused fp8 moe kernel for low-latency llm inference#49

Open

VAthree wants to merge 1 commit into

Tencent:mainfrom

VAthree:add-fused-fp8-moe-kernel

Commits on Jun 3, 2026

add fused fp8 moe kernel for low-latency llm inference
VAthree
committed