Skip to content

grpc server: better to implement GetPreferredAllocation function #574

@cyclinder

Description

@cyclinder

What would you like to be added?

If pod declares both gpu and sriov resources, as follows:

        resources:
          limits:
            nvidia.com/gpu: 1
            spidernet.io/sriov_net_device: 1

I hope that the topology of the assigned vf and gpu is at least PHB(or PXB, PIX), so that the communication between them can enable GPU Direct RDMA, greatly improving the communication speed between them.

image

NOTE: I think the Gpu-device-plugin should also implement this function, because we don't know who will allocate the GPU or VF first.

see

// TODO: (SchSeba) check if we want to use this function

What is the use case for this feature / enhancement?

enable GPU Direct RDMA.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions