Skip to content

Conversation

@empovit
Copy link
Contributor

@empovit empovit commented Sep 15, 2025

Red Hat OpenShift blocks writing into /etc, causing the following error in compute-domain-daemon pods:

IMEXDaemonUpdateLoop failed, initiate shutdown:
writeNodesConfig failed: failed to create nodes config file:
open /etc/nvidia-imex/nodes_config.cfg: permission denied

Binding anyuid SCC to the service account when run on OpenShift solves this problem.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Sep 15, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@empovit empovit marked this pull request as draft September 15, 2025 16:43
@empovit empovit marked this pull request as ready for review September 15, 2025 19:34
Red Hat OpenShift blocks writing into `/etc`, causing the
following error in compute-domain-daemon pods:

```
IMEXDaemonUpdateLoop failed, initiate shutdown:
writeNodesConfig failed: failed to create nodes config file:
open /etc/nvidia-imex/nodes_config.cfg: permission denied
```

Binding `anyuid` SCC to the service account when run on OpenShift
solves this problem.

Signed-off-by: Vitaliy Emporopulo <[email protected]>
@empovit empovit force-pushed the add-anyuid-scc-openshift-25.3 branch from 43e5139 to 0ed747c Compare September 15, 2025 20:32
@empovit empovit marked this pull request as draft September 15, 2025 20:43
@empovit empovit marked this pull request as ready for review September 15, 2025 20:51
@klueska klueska added robustness issue/pr: edge cases & fault tolerance security labels Sep 16, 2025
@klueska klueska added this to the v25.8.0 milestone Sep 16, 2025
@klueska klueska changed the title Add anyuid SCC to compute domain service account on OpenShift [backport] Add anyuid SCC to compute domain service account on OpenShift Sep 16, 2025
@klueska klueska modified the milestones: v25.8.0, v25.3.2, v25.3.3 Sep 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

robustness issue/pr: edge cases & fault tolerance security

Projects

Status: Backlog

Development

Successfully merging this pull request may close these issues.

3 participants