You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix(gpu): prefer single CDI devices for local runtimes
Prefer a single CDI-qualified device when Docker or Podman resolves the default GPU request to one GPU.
Allow nvidia.com/gpu=all only as a WSL2 all-only compatibility fallback, using Docker daemon info and Podman's /dev/dxg probe to identify that case.
Update driver docs, architecture notes, and GPU e2e coverage for the default selection behavior.
Signed-off-by: Evan Lezar <elezar@nvidia.com>
"default GPU request resolved only to nvidia.com/gpu=all, which is not allowed on this platform; set driver_config.cdi_devices to [\"nvidia.com/gpu=all\"] explicitly to request all GPUs",
154
+
),
155
+
}
156
+
}
157
+
}
158
+
159
+
impl std::error::ErrorforCdiGpuSelectionError{}
160
+
161
+
/// Resolve a local runtime GPU request into CDI device identifiers.
9
162
///
10
-
/// `None` means no GPU was requested. A GPU request with no explicit CDI
11
-
/// devices uses the CDI all-GPU request; otherwise the driver-configured CDI
12
-
/// devices pass through unchanged.
163
+
/// `None` means no GPU was requested. Explicit driver-configured CDI devices
164
+
/// pass through unchanged. A default GPU request uses the driver-selected
Copy file name to clipboardExpand all lines: crates/openshell-driver-docker/README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -32,7 +32,7 @@ contract:
32
32
|`apparmor=unconfined`| Avoids Docker's default profile blocking required mount operations. |
33
33
|`restart_policy = unless-stopped`| Keeps managed sandboxes resumable across daemon or gateway restarts. |
34
34
|`PidsLimit`| Enforces the sandbox PID budget at the Docker cgroup layer. Set `[openshell.drivers.docker].sandbox_pids_limit = 0` to inherit the Docker/runtime default. |
35
-
| CDI GPU request | Uses `driver_config.cdi_devices` when set; otherwise requests all NVIDIA GPUs when the sandbox spec asks for GPU support and daemon CDI support is detected. |
35
+
| CDI GPU request | Uses `driver_config.cdi_devices` when set; otherwise selects one concrete NVIDIA CDI GPU when the sandbox spec asks for GPU support and daemon CDI support is detected. Docker daemon `/info` can permit `nvidia.com/gpu=all` as a WSL2 all-only compatibility fallback. |
36
36
37
37
The agent child process does not retain these supervisor privileges.
0 commit comments