Windows GPU TensorRT CI Pipeline

[CUDA] Support FP8 (E4M3) KV Cache for Group Query Attention #10437

Sign in to view logs

Triggered via pull request February 14, 2026 04:12

tianleiwu

synchronize #27321

tlwu/20260211/gqa_fp8_kv_cache

Status Success

Total duration 1h 35m 2s

Artifacts 1

windows_tensorrt.yml

on: pull_request

Windows GPU TensorRT CI Pipeline

Windows GPU TensorRT CI Pipeline Test Job

Annotations

6 warnings

Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1234

epilog offset from end of function exceeds 4095

Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1227

epilog offset from end of function exceeds 4095

Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1220

epilog offset from end of function exceeds 4095

Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1213

epilog offset from end of function exceeds 4095

Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1206

epilog offset from end of function exceeds 4095

Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1199

epilog offset from end of function exceeds 4095

Artifacts

Produced during runtime

Name	Size	Digest
build-artifacts	1.9 GB	`sha256:271e4935423e6418b9369ced8e3b5c119a7e71a97637b19fd077d28c1e50672a`