ONNX Runtime CUDA Builds

mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection #10426

Sign in to view logs

Re-run triggered February 13, 2026 00:35

#27099

milpuz01:aarch64_convolutions

Status Success

Total duration 2h 14m 55s

Artifacts 1

windows_cuda.yml

on: pull_request

Windows GPU CUDA CI Pipeline

Windows GPU CUDA CI Pipeline Test Job

Annotations

6 warnings

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1234

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1227

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1220

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1213

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1206

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1199

epilog offset from end of function exceeds 4095

Artifacts

Produced during runtime

Name	Size	Digest
build-artifacts	1.99 GB	`sha256:5444f04de3f3a767224da4d26a55b159e64427c0f00b16111abb61475ce34ce0`