Skip to content

Add NIXL kv cache transfer example#203

Merged
k8s-ci-robot merged 2 commits into
kubernetes-sigs:mainfrom
anson627:add-inference-example
May 20, 2026
Merged

Add NIXL kv cache transfer example#203
k8s-ci-robot merged 2 commits into
kubernetes-sigs:mainfrom
anson627:add-inference-example

Conversation

@anson627

Copy link
Copy Markdown
Contributor

What type of PR is this?

/kind documentation

What this PR does / why we need it:

This pull request adds a complete, cloud-agnostic example for benchmarking NUMA-aware GPU-to-GPU VRAM transfers over RDMA in Kubernetes using NIXL and DRA. The example demonstrates how different NUMA placements of GPUs and RDMA NICs affect transfer bandwidth and latency, and provides manifests, scripts, and documentation for reproducible experiments.

Which issue(s) this PR is related to:

N/A

Special notes for your reviewer:

Does this PR introduce a user-facing change?

NONE

@k8s-ci-robot k8s-ci-robot added the kind/documentation Categorizes issue or PR as related to documentation. label May 18, 2026
@netlify

netlify Bot commented May 18, 2026

Copy link
Copy Markdown

Deploy Preview for dranet canceled.

Name Link
🔨 Latest commit 463209e
🔍 Latest deploy log https://app.netlify.com/projects/dranet/deploys/6a0b9cabb751ce0008db6c83

@k8s-ci-robot k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels May 18, 2026
@anson627 anson627 changed the title Add inference example Add NIXL kv cache transfer example May 18, 2026
@gauravkghildiyal

Copy link
Copy Markdown
Member

/assign

@gauravkghildiyal

Copy link
Copy Markdown
Member

Very cool!

/lgtm
/approve

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 20, 2026
@k8s-ci-robot

Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: anson627, gauravkghildiyal

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 20, 2026
@gauravkghildiyal

Copy link
Copy Markdown
Member

/label tide/merge-method-squash

@k8s-ci-robot k8s-ci-robot added the tide/merge-method-squash Denotes a PR that should be squashed by tide when it merges. label May 20, 2026
@k8s-ci-robot k8s-ci-robot merged commit c43ca7e into kubernetes-sigs:main May 20, 2026
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/documentation Categorizes issue or PR as related to documentation. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. tide/merge-method-squash Denotes a PR that should be squashed by tide when it merges.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants