Skip to content

[llama32_1b] 3-launch o_gemv_ffn for decode (+17% tok/s) (#1631) #1161

[llama32_1b] 3-launch o_gemv_ffn for decode (+17% tok/s) (#1631)

[llama32_1b] 3-launch o_gemv_ffn for decode (+17% tok/s) (#1631) #1161