chore: update nvidia plugin to v0.18.2 #79
Workflow file for this run
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| name: ROCm GPU Sim Cluster CI | |
| on: # yamllint disable-line rule:truthy | |
| push: | |
| paths: | |
| - kind-gpu-sim.sh | |
| - pods/rocm-gpu-test-pod.yaml | |
| - .github/workflows/rocm-ci.yaml | |
| workflow_dispatch: | |
| jobs: | |
| gpu-sim-test: | |
| runs-on: ubuntu-latest | |
| steps: | |
| - name: Checkout repo | |
| uses: actions/checkout@v5 | |
| - name: Install KinD + kubectl | |
| run: | | |
| curl -Lo ./kind https://kind.sigs.k8s.io/dl/v0.27.0/kind-linux-amd64 | |
| chmod +x ./kind | |
| sudo mv ./kind /usr/local/bin/kind | |
| curl -LO "https://dl.k8s.io/release/v1.32.0/bin/linux/amd64/kubectl" | |
| chmod +x kubectl && sudo mv kubectl /usr/local/bin/kubectl | |
| - name: Create Kind GPU cluster | |
| run: | | |
| ./kind-gpu-sim.sh create rocm | |
| - name: Deploy test pod | |
| run: | | |
| kubectl create -f pods/rocm-gpu-test-pod.yaml | |
| kubectl wait --for=condition=Ready pod/gpu-rocm-test --timeout=60s | |
| - name: Check pod logs | |
| run: | | |
| kubectl logs pod/gpu-rocm-test |