Skip to content

Conversation

@elezar
Copy link
Member

@elezar elezar commented Feb 28, 2025

This change copies ldconfig into a memfd before executing it from the createContainer hook.

See NVIDIA/libnvidia-container@ff83efb where this was done for libnvidia-container.

This backports #941

@elezar elezar self-assigned this Feb 28, 2025
@elezar elezar added this to the v1.17.5 milestone Feb 28, 2025
@elezar elezar requested a review from klueska February 28, 2025 12:05
This change copies ldconfig into a memfd before executing it from
the createContainer hook.

Signed-off-by: Evan Lezar <[email protected]>
@elezar elezar merged commit 267fb59 into NVIDIA:release-1.17 Feb 28, 2025
10 checks passed
@elezar elezar deleted the seal-ldconfig branch February 28, 2025 20:13
debarshiray added a commit to debarshiray/toolbox that referenced this pull request Jun 11, 2025
NVIDIA Container Toolkit 1.17.5 requires Go >= 1.22 [1], and starts
using enable-cuda-compat hooks in the Container Device Interface
specification generated by it [2].  For example:
  "hookName": "createContainer",
  "path": "/usr/bin/nvidia-cdi-hook",
  "args": [
    "nvidia-cdi-hook",
    "enable-cuda-compat",
    "--host-driver-version=570.153.02"
  ]

The new hook makes it possible to have containers with a
/usr/local/cuda/compat/libcuda.so.* that's newer than the proprietary
NVIDIA driver on the host operating system, so that applications can use
a newer CUDA without having to update the driver [3].  Even though this
sounds useful, the hook has been disabled until it's handled by the
'init-container' command and there's a clear way to test it.

The src/go.sum file was updated with 'go mod tidy'.

[1] NVIDIA Container Toolkit commit 5bdf14b1e7c24763
    NVIDIA/nvidia-container-toolkit@5bdf14b1e7c24763
    NVIDIA/nvidia-container-toolkit#941
    NVIDIA/nvidia-container-toolkit#950

[2] NVIDIA Container Toolkit commit 76040ff2ad63fb82
    NVIDIA/nvidia-container-toolkit@76040ff2ad63fb82
    NVIDIA/nvidia-container-toolkit#906
    NVIDIA/nvidia-container-toolkit#948

[3] https://docs.nvidia.com/deploy/cuda-compatibility/

containers#1662
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants