Skip to content

NCCL on Kubernetes #852

NCCL on Kubernetes

NCCL on Kubernetes #852

Manually triggered July 30, 2025 08:58
Status Failure
Total duration 12m 30s
Artifacts 2

nccl-k8s.yaml

on: workflow_dispatch
nccl-tests  /  build-mpi-operator-compatible-base
2m 25s
nccl-tests / build-mpi-operator-compatible-base
nccl-tests  /  ...  /  build-nccl-gke
1m 35s
nccl-tests / nccl-test-gke / build-nccl-gke
Matrix: nccl-tests / nccl-test
Matrix: nccl-tests / nccl-test-gke / nccl-gke
Fit to window
Zoom out
Zoom in

Annotations

4 errors
nccl-tests / nccl-test-gke / nccl-gke (all_reduce_perf_mpi)
Process completed with exit code 127.
nccl-tests / nccl-test-gke / nccl-gke (reduce_scatter_perf_mpi)
The strategy configuration was canceled because "nccl-tests.nccl-test-gke.nccl-gke.all_reduce_perf_mpi" failed
nccl-tests / nccl-test-gke / nccl-gke (all_gather_perf_mpi)
The strategy configuration was canceled because "nccl-tests.nccl-test-gke.nccl-gke.all_reduce_perf_mpi" failed
nccl-tests / nccl-test-gke / nccl-gke (broadcast_perf_mpi)
The strategy configuration was canceled because "nccl-tests.nccl-test-gke.nccl-gke.all_reduce_perf_mpi" failed

Artifacts

Produced during runtime
Name Size Digest
artifact-mpi-operator-compatible-base-build-amd64 Expired
638 Bytes
sha256:dbbd9e9bd8ddad9578e5f5deeee9a5d4d7a3552e8f0e7055917bef354b3ac852
artifact-nccl-gke-build-amd64 Expired
572 Bytes
sha256:f9af7feaba230f81135d1a6fc510f0d4a56b1c942a9b8b67f0fdee31ff07e764