-
Notifications
You must be signed in to change notification settings - Fork 255
Enable clock_gettime through seccomp, required by 580. #324
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
After upgrading to nvidia 580.76.05-6 from nvidia 575.64.05-4, my Incus containers failed to boot up. Running Incus in debug mode, I could find this log line: lxc baldvin-test2 20250828094627.156 DEBUG utils - ../src/lxc/utils.c:run_buffer:560 - Script exec /usr/share/lxc/hooks/nvidia produced output: nvidia-container-cli: ldcache error: process /usr/bin/ldconfig terminated with signal 9 Inserting strace into /usr/share/lxc/hooks/nvidia, the strace output had 600MB+ worth of clock_gettime(CLOCK_MONOTONIC, 0x7fff12e98060) = -1 EPERM (Operation not permitted) This CL adds clock_gettime to the list of permitted calls, which fixed the issue on my machine.
|
I don't really see how the driver upgrade would cause this. From the perspective of ldconfig which is being run to update the ldcache in the container the driver libraries are opague. @baldvin-kovacs do you know what is triggering the |
Sorry, no clue. I just know that after the upgrade, my incus containers stopped working. I was stracing it, and saw this call rejected. I made this change, and all of a sudden my incus containers were working again :) So yes, it can be a change in other systems --- can be that this particular call was made earlier too, but the execution environment never rejected it, and now that became more strict. Unfortunately I just upgraded with a pacman -Syu, so not separately the nvidia driver. I did not trace back the issue to a change of the nvidia driver versus a change of the execution environment, sorry... |
|
@baldvin-kovacs would you be able to sign-off your commit as required by the DCO? Otherwise I could create a PR with this change. |
|
@baldvin-kovacs we should be able to verify that it's not specific to the driver by running |
|
@lahwaacz have you been able to narrow down the change in |
|
No, I did not debug this at all. |
|
I created #326 to be able to get this in before the upcoming v1.18.0 release. |
|
Especially given the |
After upgrading to nvidia 580.76.05-6 from nvidia 575.64.05-4, my Incus containers failed to boot up.
Running Incus in debug mode, I could find this log line:
lxc baldvin-test2 20250828094627.156 DEBUG utils - ../src/lxc/utils.c:run_buffer:560 - Script exec /usr/share/lxc/hooks/nvidia produced output: nvidia-container-cli: ldcache error: process /usr/bin/ldconfig terminated with signal 9
Inserting strace into /usr/share/lxc/hooks/nvidia, the strace output had 600MB+ worth of
clock_gettime(CLOCK_MONOTONIC, 0x7fff12e98060) = -1 EPERM (Operation not permitted)
This CL adds clock_gettime to the list of permitted calls, which fixed the issue on my machine.