Skip to content

[CUDA] GroupQueryAttention with XQA and Quantized KV Cache Support #10049

[CUDA] GroupQueryAttention with XQA and Quantized KV Cache Support

[CUDA] GroupQueryAttention with XQA and Quantized KV Cache Support #10049

Triggered via pull request February 5, 2026 00:39
Status Cancelled
Total duration 33m 57s
Artifacts

android.yml

on: pull_request
AndroidBinarySizeCheckJob_MinimalBaseline
11m 18s
AndroidBinarySizeCheckJob_MinimalBaseline
android_nnapi_ep
33m 24s
android_nnapi_ep
Android CI Pipeline
27m 39s
Android CI Pipeline
Fit to window
Zoom out
Zoom in

Annotations

3 errors and 2 warnings
Android CI
Canceling since a higher priority waiting request for Android CI-refs/pull/27246/merge exists
android_nnapi_ep
Canceling since a higher priority waiting request for Android CI-refs/pull/27246/merge exists
android_nnapi_ep
The operation was canceled.
AndroidBinarySizeCheckJob_MinimalBaseline
stderr: WARNING! Your credentials are stored unencrypted in '/home/cloudtest/.docker/config.json'. Configure a credential helper to remove this warning. See https://docs.docker.com/go/credential-store/
android_nnapi_ep
Terrapin tool path provided but not found at 'C:/local/Terrapin/TerrapinRetrievalTool.exe'. Attempting direct download for vcpkg.