Skip to content

Conversation

@vsoch
Copy link
Member

@vsoch vsoch commented Jul 4, 2025

We want to test if DRA (and dranet) can better utilize the infiniband devices. We have them working with UCX, and actually I am not sure if this will add anything, but it was worth trying for minimally the learning. A few notes:

  • The main issue that the device isn't listed as a slice is because the dranet code is looking for netdevices. I need to read the dranet codebase tonight and understand the execution pathway and where I can tweak this.
  • I think we can simplify the dranet container we use, which is currently built as usernetes_dra if the device is discovered. I needed to build with podman and add the certs, and use the same usernetes base to debug why ibv_devinfo was not working. Creation of the mlx4 and mlx5 drivers in /etc/libibutils.dev seemed to show the devices, although that wasn't the issue.

I am trying to keep dra modular (in a separate container) although another option is to put it in the usernetes node. I like this approach less.

Next steps - I'm going to do some reading of dranet and then test builds that might discover the device. We want it to show up with:

kubectl  get resourceslices.resource.k8s.io -o yaml

And when that happens I can test it out for the first time with a pod. I don't actually know if it will add anything, but I was curious.

@vsoch vsoch force-pushed the dranet branch 3 times, most recently from d818c9a to 52718c1 Compare July 5, 2025 21:45
We want to test if DRA (and dranet) can better
utilize the infiniband devices. We have them
working with UCX, and actually I am not sure
if this will add anything, but it was worth trying
for minimally the learning.

Signed-off-by: vsoch <vsoch@users.noreply.github.com>
Signed-off-by: vsoch <vsoch@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants