fix: consider kv cache 64 align to estimate kv cache memory size by rebel-wonsubkim · Pull Request #417 · RBLN-SW/vllm-rbln

rebel-wonsubkim · 2026-02-27T01:16:25Z

Problem - in opt model test, kv cache oom occurred

Solution - when calculating kv cache available memory, SHOULD consider device kv cache 64 memory align
if head_size is not aligned to 64 (=128B), SHOULD adjust available memory for kv cache

+ SHOULD consider device 64B memory align for kv cache Signed-off-by: wonsub kim <subang0@rebellions.ai>

rebel-jiwoopark

lgtm

rebel-jiwoopark · 2026-03-16T06:40:03Z

@rebel-wonsubkim If there are no problems, we can go ahead and merge this.

fix bug related to kv cache memory estimation

38cbc0e

+ SHOULD consider device 64B memory align for kv cache Signed-off-by: wonsub kim <subang0@rebellions.ai>

rebel-wonsubkim changed the base branch from main to dev February 27, 2026 01:16

rebel-wonsubkim changed the title ~~fix : consider kv cache 64 align to estimate kv cache memory size~~ fix: consider kv cache 64 align to estimate kv cache memory size Feb 27, 2026

rebel-wonsubkim requested review from rebel-jaehwang, rebel-jiwoopark and rebel-ykchoi February 27, 2026 01:19

rebel-jaehwang approved these changes Mar 3, 2026

View reviewed changes

rebel-jiwoopark approved these changes Mar 11, 2026

View reviewed changes

rebel-jiwoopark assigned rebel-wonsubkim Mar 11, 2026

rebel-jiwoopark added the torch.compile torch.compile based implementation label Mar 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: consider kv cache 64 align to estimate kv cache memory size#417

fix: consider kv cache 64 align to estimate kv cache memory size#417
rebel-wonsubkim wants to merge 1 commit intodevfrom
fix_cache_64align

rebel-wonsubkim commented Feb 27, 2026 •

edited

Loading

Uh oh!

rebel-jiwoopark left a comment

Uh oh!

rebel-jiwoopark commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rebel-wonsubkim commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rebel-jiwoopark left a comment

Choose a reason for hiding this comment

Uh oh!

rebel-jiwoopark commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rebel-wonsubkim commented Feb 27, 2026 •

edited

Loading