[CUDA] GroupQueryAttention with XQA and Quantized KV Cache Support #10049
Triggered via pull request
February 5, 2026 00:39
Status
Cancelled
Total duration
33m 57s
Artifacts
–
android.yml
on: pull_request
AndroidBinarySizeCheckJob_MinimalBaseline
11m 18s
android_nnapi_ep
33m 24s
Android CI Pipeline
27m 39s
Annotations
3 errors and 2 warnings
|
Android CI
Canceling since a higher priority waiting request for Android CI-refs/pull/27246/merge exists
|
|
android_nnapi_ep
Canceling since a higher priority waiting request for Android CI-refs/pull/27246/merge exists
|
|
android_nnapi_ep
The operation was canceled.
|
|
AndroidBinarySizeCheckJob_MinimalBaseline
stderr: WARNING! Your credentials are stored unencrypted in '/home/cloudtest/.docker/config.json'.
Configure a credential helper to remove this warning. See
https://docs.docker.com/go/credential-store/
|
|
android_nnapi_ep
Terrapin tool path provided but not found at 'C:/local/Terrapin/TerrapinRetrievalTool.exe'. Attempting direct download for vcpkg.
|