feat: prevent containers from running in NRI if we fails to apply policy on it by holyspectral · Pull Request #392 · rancher-sandbox/runtime-enforcer

holyspectral · 2026-03-11T20:10:42Z

What this PR does / why we need it:

As part of error handling, when we failed to apply protection on a container, we will fail the container creation flow by default. Users can override this behavior via NRI_FAILOPEN env var.

When a container is prevented from starting, logs can be seen in these places:

In our log:

{"time":"2026-03-11T20:57:12.766551002Z","level":"ERROR","msg":"Runtime-enforcer has prevented the container from starting. To change this behavior, set environment variable NRI_FAILOPEN to true","component":"agent","component":"nri-handler","component":"nri-plugin","reason":"failed to add pod container from NRI","containerName":"ubuntu","podName":"ubuntu-deployment-595f9465f7-dnstl","error":"SOME ERROR"}

containerd:

Mar 11 20:59:31 kind-control-plane containerd[122398]: time="2026-03-11T20:59:31.776225421Z" level=error msg="NRI container start failed" error="rpc error: code = Unknown desc = failed to add pod container from NRI: SOME ERROR. Runtime-enforcer has prevented the container 'ubuntu-cronjob-29554377-5tzw4/ubuntu' from starting. To change this behavior, set environment variable NRI_FAILOPEN to true"

kubernetes:

    lastState:                                                          
      terminated:                                                       
        containerID: containerd://2fa1c89c84c448d36d878a44439cc2901bc2939b58be1bc5f3751be31ac4b23d
        exitCode: 128                                                   
        finishedAt: "2026-03-11T20:57:28Z"                              
        message: 'NRI container start failed: rpc error: code = Unknown desc = failed
          to add pod container from NRI: SOME ERROR. Runtime-enforcer has prevented
          the container ''ubuntu-deployment-595f9465f7-dnstl/ubuntu'' from starting.
          To change this behavior, set environment variable NRI_FAILOPEN to true'
        reason: StartError                                              
        startedAt: "1970-01-01T00:00:00Z"

Which issue(s) this PR fixes

fixes #262

Special notes for your reviewer:

Checklist:

squashed commits into logical changes
includes documentation
adds unit tests
adds or updates e2e tests

As part of error handling, when we failed to apply protection on a container, we will fail the container creation flow by default. Users can use NRI_FAILOPEN envvar to change this behavior. Signed-off-by: Sam Wang (holyspectral) <sam.wang@suse.com>

Andreagit97

Thank you!

Andreagit97 · 2026-03-12T16:07:30Z

internal/nri/plugin.go

+				"namespace", pod.GetNamespace(),
+				"error", err,
+			)
+			return nil, fmt.Errorf("failed to add pod container from NRI: %w", err)


If we return an error from StartContainer, the container won't start. What happens if we return an error here in Synchronize?

Andreagit97 · 2026-03-12T16:08:49Z

internal/nri/handler.go

 	p := &plugin{
 		logger:   logger.With("component", "nri-plugin"),
 		resolver: resolver,
+		failOpen: os.Getenv("NRI_FAILOPEN") == "true",


How should the final user should set this environment variable? I would expect a helm field in values.yaml, WDYT?

yeah this is something we can talk about. Today we already can specify the environment variable via agent.env but we can also make it more specific by providing a separate option. WDYT?

yeah this seems an important feature for users, so I would probably add a new helm field

Andreagit97 · 2026-03-12T16:45:33Z

internal/nri/plugin.go

 		"containerName", container.GetName(),
 	)

+	handleError := func(reason string, err error) error {


Can we simplify this a little bit to avoid duplication?

handleError := func(reason string, err error) error { // fail open defaults var errNRI error msg := "container is starting WITHOUT enforcement due to NRI_FAILOPEN" if !p.failOpen { errNRI = fmt.Errorf("%s: %w. Runtime-enforcer has prevented the container '%s/%s' from starting. To change this behavior, set environment variable NRI_FAILOPEN to true", reason, err, pod.GetName(), container.GetName()) msg = errNRI.Error() } p.logger.ErrorContext( ctx, msg, "containerID", container.GetId(), "containerName", container.GetName(), "podName", pod.GetName(), "podID", pod.GetUid(), ) return errNRI }

In the end, we can use error verbosity in both cases since a container is not protected

Andreagit97 · 2026-03-12T16:47:29Z

internal/resolver/policy.go

 func (r *Resolver) applyPolicyToPodIfPresent(state *podEntry) error {
 	policyName := state.policyName()

 	// if the policy doesn't have the label we do nothing


Suggested change

// if the pod doesn't have the label we do nothing

Andreagit97 · 2026-03-12T16:52:33Z

internal/resolver/policy.go

-			state.podName(),
-			state.podNamespace(),
-			policyName,
+		// This can happen when the pod runs before the policy is created/reconciled when using GitOps to deploy.


It's really up to us what we want to do. I'm also fine with returning an error since the pod will start without protection, but the user might think this is protected...

holyspectral self-assigned this Mar 11, 2026

holyspectral added the enhancement New feature or request label Mar 11, 2026

holyspectral marked this pull request as draft March 11, 2026 20:28

holyspectral force-pushed the prevent-container-from-running-nri branch from e5c558c to 45cfd00 Compare March 11, 2026 20:49

holyspectral marked this pull request as ready for review March 11, 2026 21:02

holyspectral force-pushed the prevent-container-from-running-nri branch from 45cfd00 to 1f93cdd Compare March 11, 2026 22:44

Andreagit97 reviewed Mar 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: prevent containers from running in NRI if we fails to apply policy on it#392

feat: prevent containers from running in NRI if we fails to apply policy on it#392
holyspectral wants to merge 1 commit intorancher-sandbox:mainfrom
holyspectral:prevent-container-from-running-nri

holyspectral commented Mar 11, 2026 •

edited

Loading

Uh oh!

Andreagit97 left a comment

Uh oh!

Andreagit97 Mar 12, 2026

Uh oh!

Andreagit97 Mar 12, 2026

Uh oh!

holyspectral Mar 12, 2026

Uh oh!

Andreagit97 Mar 13, 2026

Uh oh!

Andreagit97 Mar 12, 2026

Uh oh!

Andreagit97 Mar 12, 2026

Uh oh!

Andreagit97 Mar 12, 2026

Uh oh!

Andreagit97 Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

holyspectral commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Andreagit97 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

holyspectral commented Mar 11, 2026 •

edited

Loading