Skip to content

update: change kv cache to meta tensor until device tensor is fully supported#306

Merged
rebel-jonghewk merged 1 commit intodev-0.12from
kv-cache-meta
Jan 30, 2026
Merged

update: change kv cache to meta tensor until device tensor is fully supported#306
rebel-jonghewk merged 1 commit intodev-0.12from
kv-cache-meta

Conversation

@rebel-jonghewk
Copy link
Copy Markdown
Collaborator

🚀 Summary of Changes

What does this PR do? What feature, fix, or improvement does it bring?

While device tensor (rbln) support in vLLM is under development, this PR replaces the CPU-based KV cache with meta-device tensors to prevent unnecessary host memory usage.

This will require latest rebel-compiler to work.


📌 Related Issues / Tickets

  • Resolves #
  • Related to #

✅ Type of Change

  • ✨ Feature (feature)
  • 🧠 Model support (model)
  • 🧬 Core engine changes (core)
  • 🛠 Bug fix (bug-fix)
  • ⚙️ Performance improvement (perf)
  • 🔁 Refactor or code cleanup (refactor)
  • 📄 Documentation (docs)
  • ❓ Other (other): please describe

🧪 How to Test

  1. Run ...
  2. Verify output: ...
  3. Edge case tested: ...

📸 Screenshots / Logs (if applicable)


📋 Checklist

  • PR title follows Conventional Commits format
  • This PR is linked to an existing issue
  • The test method is described, and the expected result is clearly stated
  • Relevant documentation has been updated (if applicable)

💬 Notes


Copy link
Copy Markdown
Collaborator

@rebel-eunji rebel-eunji left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@rebel-jonghewk rebel-jonghewk changed the title update: change kv cahce to meta tensor until device tensor is fully supported update: change kv cache to meta tensor until device tensor is fully supported Jan 29, 2026
@rebel-jonghewk rebel-jonghewk merged commit af04acc into dev-0.12 Jan 30, 2026
1 check passed
@rebel-jonghewk rebel-jonghewk deleted the kv-cache-meta branch January 30, 2026 03:48
rebel-jaehwang pushed a commit that referenced this pull request Jan 30, 2026
rebel-jaehwang pushed a commit that referenced this pull request Jan 30, 2026
rebel-jaehwang pushed a commit that referenced this pull request Jan 30, 2026
rebel-wonsubkim added a commit that referenced this pull request Feb 2, 2026
rebel-wonsubkim added a commit that referenced this pull request Feb 2, 2026
rebel-jaehwang pushed a commit that referenced this pull request Feb 2, 2026
rebel-jiwoopark pushed a commit that referenced this pull request Feb 4, 2026
rebel-jiwoopark pushed a commit that referenced this pull request Feb 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants