Skip to content

[CUDA] GroupQueryAttention with XQA and Quantized KV Cache Support #10482

[CUDA] GroupQueryAttention with XQA and Quantized KV Cache Support

[CUDA] GroupQueryAttention with XQA and Quantized KV Cache Support #10482

Triggered via pull request February 9, 2026 18:36
Status Success
Total duration 40m 34s
Artifacts

ios.yml

on: pull_request
Fit to window
Zoom out
Zoom in

Annotations

1 warning
iOS_CI_on_Mac
Terrapin tool path provided but not found at 'C:/local/Terrapin/TerrapinRetrievalTool.exe'. Attempting direct download for vcpkg.