Skip to content

NCCL on Kubernetes #851

NCCL on Kubernetes

NCCL on Kubernetes #851

Manually triggered July 30, 2025 08:54
Status Cancelled
Total duration 4m 28s
Artifacts 2

nccl-k8s.yaml

on: workflow_dispatch
nccl-tests  /  build-mpi-operator-compatible-base
1m 46s
nccl-tests / build-mpi-operator-compatible-base
nccl-tests  /  ...  /  build-nccl-gke
1m 20s
nccl-tests / nccl-test-gke / build-nccl-gke
Matrix: nccl-tests / nccl-test
Matrix: nccl-tests / nccl-test-gke / nccl-gke
Fit to window
Zoom out
Zoom in

Annotations

9 errors
nccl-tests / build-mpi-operator-compatible-base
buildx failed with: ERROR: failed to build: failed to solve: process "/bin/sh -c apt-get update && apt install -y openssh-server && apt-get clean && rm -rf /var/lib/apt/lists/* && mkdir /run/sshd" did not complete successfully: exit code: 1
nccl-tests / nccl-test-gke / nccl-gke (broadcast_perf_mpi)
The run was canceled by @olupton.
nccl-tests / nccl-test-gke / nccl-gke (all_reduce_perf_mpi)
The run was canceled by @olupton.
nccl-tests / nccl-test-gke / nccl-gke (all_gather_perf_mpi)
The operation was canceled.
nccl-tests / nccl-test-gke / nccl-gke (all_gather_perf_mpi)
The run was canceled by @olupton.
NCCL on Kubernetes
The run was canceled by @olupton.
NCCL on Kubernetes
The run was canceled by @olupton.
NCCL on Kubernetes
The run was canceled by @olupton.

Artifacts

Produced during runtime
Name Size Digest
artifact-mpi-operator-compatible-base-build-amd64 Expired
584 Bytes
sha256:2622a2abb92d553a5f2bf7682f065fb28251bdfb65bdb39d921d458c9b554338
artifact-nccl-gke-build-amd64 Expired
572 Bytes
sha256:e79b794b3f31066ec3b4f27cd6c00e6624b0d349c65e7ea2d22f982e9f46d35a