update: change kv cache to meta tensor until device tensor is fully supported by rebel-jonghewk · Pull Request #306 · RBLN-SW/vllm-rbln

rebel-jonghewk · 2026-01-29T01:24:14Z

🚀 Summary of Changes

What does this PR do? What feature, fix, or improvement does it bring?

While device tensor (rbln) support in vLLM is under development, this PR replaces the CPU-based KV cache with meta-device tensors to prevent unnecessary host memory usage.

This will require latest rebel-compiler to work.

📌 Related Issues / Tickets

Resolves #
Related to #

✅ Type of Change

✨ Feature (feature)
🧠 Model support (model)
🧬 Core engine changes (core)
🛠 Bug fix (bug-fix)
⚙️ Performance improvement (perf)
🔁 Refactor or code cleanup (refactor)
📄 Documentation (docs)
❓ Other (other): please describe

🧪 How to Test

Run ...
Verify output: ...
Edge case tested: ...

📸 Screenshots / Logs (if applicable)

📋 Checklist

PR title follows Conventional Commits format
This PR is linked to an existing issue
The test method is described, and the expected result is clearly stated
Relevant documentation has been updated (if applicable)

💬 Notes

rebel-eunji

👍

This reverts commit af04acc.

change kv cahce to meta tensor

4a03949

rebel-jonghewk requested review from rebel-eunji, rebel-jaehwang, rebel-jiwoopark, rebel-jongho, rebel-myeongbo and rebel-wonsubkim January 29, 2026 01:24

rebel-eunji approved these changes Jan 29, 2026

View reviewed changes

rebel-jonghewk changed the title ~~update: change kv cahce to meta tensor until device tensor is fully supported~~ update: change kv cache to meta tensor until device tensor is fully supported Jan 29, 2026

rebel-jongho approved these changes Jan 30, 2026

View reviewed changes

rebel-jonghewk merged commit af04acc into dev-0.12 Jan 30, 2026
1 check passed

rebel-jonghewk deleted the kv-cache-meta branch January 30, 2026 03:48

rebel-jaehwang pushed a commit that referenced this pull request Jan 30, 2026

change kv cahce to meta tensor (#306)

77a6a57

rebel-jaehwang pushed a commit that referenced this pull request Jan 30, 2026

change kv cahce to meta tensor (#306)

fbc6923

rebel-jaehwang pushed a commit that referenced this pull request Jan 30, 2026

change kv cahce to meta tensor (#306)

3770a07

rebel-wonsubkim added a commit that referenced this pull request Feb 2, 2026

Revert "change kv cahce to meta tensor (#306)"

0e86be8

This reverts commit af04acc.

rebel-wonsubkim added a commit that referenced this pull request Feb 2, 2026

Revert "change kv cahce to meta tensor (#306)" (#321)

c871cb2

This reverts commit af04acc.

rebel-jaehwang pushed a commit that referenced this pull request Feb 2, 2026

Revert "change kv cahce to meta tensor (#306)" (#321)

bcac89f

This reverts commit af04acc.

rebel-jinhwan mentioned this pull request Feb 2, 2026

feat(worker): chunked pipeline parallelism (CPP) with async P2P send rebel-jinhwan/vllm-rbln#6

Draft

rebel-jiwoopark pushed a commit that referenced this pull request Feb 4, 2026

change kv cahce to meta tensor (#306)

8555d77

rebel-jiwoopark pushed a commit that referenced this pull request Feb 4, 2026

Revert "change kv cahce to meta tensor (#306)" (#321)

e2fff67

This reverts commit af04acc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update: change kv cache to meta tensor until device tensor is fully supported#306

update: change kv cache to meta tensor until device tensor is fully supported#306
rebel-jonghewk merged 1 commit intodev-0.12from
kv-cache-meta

rebel-jonghewk commented Jan 29, 2026

Uh oh!

rebel-eunji left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rebel-jonghewk commented Jan 29, 2026

🚀 Summary of Changes

📌 Related Issues / Tickets

✅ Type of Change

🧪 How to Test

📸 Screenshots / Logs (if applicable)

📋 Checklist

💬 Notes

Uh oh!

rebel-eunji left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants