ns: refresh process namespace on stale nsenter (e.g. iscsid PID change) by ksoviero · Pull Request #176 · longhorn/go-common-libs

ksoviero · 2026-04-24T03:28:34Z

Made-with: Cursor

Which issue(s) this PR fixes:

What this PR does / why we need it:

nsenter in the namespace executor is configured using the mount/network namespace paths under the host’s /proc, which are derived from a single chosen PID (e.g. iscsid for iSCSI operations). That path is resolved once when the executor is created and then reused for every command. If the daemon restarts, its PID (and therefore /host/proc//ns/...) changes or disappears. The next nsenter then fails with errors like cannot open .../ns/mnt: No such file or directory—which shows up in Longhorn as iSCSI initiator refresh / frontend expand failures after iscsi(d) or similar restarts.

This PR keeps the same behavior for successful runs, but on a narrow, recognizable failure from nsenter (stale namespace path under /ns/…), it re-resolves the process namespace directory the same way as at creation, updates the cached nsDirectory, and retries the command once. If re-resolution fails, the original nsenter error is returned so operators still see the primary failure mode.

Why it’s needed

It removes the need to restart the instance manager (or other workarounds) when the only problem is an outdated cached PID for a long-lived Executor, and makes volume operations that depend on iscsiadm in the daemon’s namespaces more robust across iscsi daemon restarts on the node.

Special notes for your reviewer:

I tested this is in my homelab and it resolved the issue with expanding volumes failing after the iSCSI daemon had been restarted.

ksoviero · 2026-05-11T14:07:55Z

@derekbit @c3y1huang is there anything I can do to help get this across the finish line?

tvanderka · 2026-05-12T17:19:22Z

Would still fail if the old pid got reused by a new process. Rare corner case, but it would likely run iscsiadm from that new pids ns as a privileged process.
edit: or just run a random container with process named "iscsid" to hijack the executor?

Signed-off-by: Kevin Soviero <ksoviero@gmail.com>

ksoviero · 2026-06-15T21:42:12Z

Would still fail if the old pid got reused by a new process

That's a problem as is, so this PR doesn't introduce any regressions in that regard.

Would still fail if the old pid got reused by a new process. Rare corner case, but it would likely run iscsiadm from that new pids ns as a privileged process.
edit: or just run a random container with process named "iscsid" to hijack the executor?

I don't see a good workaround without completely re-architecting how iscsid is implemented in Longhorn. For now, this is the least bad option that at least solves the current problem where applying security updates breaks your ability to scale volumes in Longhorn.

ksoviero force-pushed the main branch from d7f6313 to b29b6cb Compare April 24, 2026 03:37

derekbit requested a review from c3y1huang April 25, 2026 07:55

derekbit assigned ksoviero May 2, 2026

derekbit force-pushed the main branch from b29b6cb to 33cca75 Compare May 5, 2026 05:07

ns: refresh process namespace on stale nsenter (e.g. iscsid PID change)

44637ce

Signed-off-by: Kevin Soviero <ksoviero@gmail.com>

derekbit force-pushed the main branch from 33cca75 to 44637ce Compare May 18, 2026 09:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ns: refresh process namespace on stale nsenter (e.g. iscsid PID change)#176

ns: refresh process namespace on stale nsenter (e.g. iscsid PID change)#176
ksoviero wants to merge 1 commit into
longhorn:mainfrom
ksoviero:main

ksoviero commented Apr 24, 2026

Uh oh!

ksoviero commented May 11, 2026

Uh oh!

tvanderka commented May 12, 2026 •

edited

Loading

Uh oh!

ksoviero commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ksoviero commented Apr 24, 2026

Which issue(s) this PR fixes:

What this PR does / why we need it:

Special notes for your reviewer:

Uh oh!

ksoviero commented May 11, 2026

Uh oh!

tvanderka commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ksoviero commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tvanderka commented May 12, 2026 •

edited

Loading