fix(lora): use float32 intermediate buffer in fused MoE LoRA to prevent bf16 precision loss#38686
Closed
prsabahrami wants to merge 1 commit intovllm-project:mainfrom
Closed
fix(lora): use float32 intermediate buffer in fused MoE LoRA to prevent bf16 precision loss#38686prsabahrami wants to merge 1 commit intovllm-project:mainfrom
prsabahrami wants to merge 1 commit intovllm-project:mainfrom