You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
cluster-api: node template in scale-from-0-nodes scenario with DRA
Modify TemplateNodeInfo() to return the template of ResourceSlice.
This is to address the DRA expansion of Cluster Autoscaler, allowing users to set the number of GPUs and DRA driver name by specifying
the annotation to NodeGroup provided by cluster-api.
Signed-off-by: Tsubasa Watanabe <[email protected]>
*Note* the `maxPods` annotation will default to `110` if it is not supplied.
232
236
This value is inspired by the Kubernetes best practices
233
237
[Considerations for large clusters](https://kubernetes.io/docs/setup/best-practices/cluster-large/).
234
238
239
+
*Note* User should select the annotation for GPU either `gpu-type` or `dra-driver` depends on whether using Device Plugin or Dynamic Resource Allocation(DRA). `gpu-count` is a common parameter in both.
240
+
235
241
#### RBAC changes for scaling from zero
236
242
237
243
If you are using the opt-in support for scaling from zero as defined by the
name: "When the NodeGroup can scale from zero and DRA is enabled, it creates ResourceSlice derived from the annotation of DRA driver name and GPU count",
t.Errorf("Expected the number of DRA devices in ResourceSlice to have: %d, but got: %d", config.expectedResourceSlice.gpuCount, len(resourceslice.Spec.Devices))
0 commit comments