Skip to content

Support group query attention in Attention(23) CUDA (#27082) #10395

Support group query attention in Attention(23) CUDA (#27082)

Support group query attention in Attention(23) CUDA (#27082) #10395

Triggered via push February 11, 2026 17:58
Status Success
Total duration 26m 21s
Artifacts
build_x64_release_xnnpack
25m 21s
build_x64_release_xnnpack
Fit to window
Zoom out
Zoom in

Annotations

6 warnings
build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1234
epilog offset from end of function exceeds 4095
build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1227
epilog offset from end of function exceeds 4095
build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1220
epilog offset from end of function exceeds 4095
build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1213
epilog offset from end of function exceeds 4095
build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1206
epilog offset from end of function exceeds 4095
build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1199
epilog offset from end of function exceeds 4095