windows_x64_release_xnnpack

[CUDA] Support FP8 (E4M3) KV Cache for Group Query Attention #10416

Sign in to view logs

Triggered via pull request February 12, 2026 00:50

tianleiwu

opened #27321

tlwu/20260211/gqa_fp8_kv_cache

Status Success

Total duration 27m 41s

Artifacts –

windows_x64_release_xnnpack.yml

on: pull_request

build_x64_release_xnnpack

Annotations

6 warnings

build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1234

epilog offset from end of function exceeds 4095

build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1227

epilog offset from end of function exceeds 4095

build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1220

epilog offset from end of function exceeds 4095

build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1213

epilog offset from end of function exceeds 4095

build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1206

epilog offset from end of function exceeds 4095

build_x64_release_xnnpack: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1199

epilog offset from end of function exceeds 4095