feat:add support to invalidate KV cache via AllBlocksCleared event by yash9263 · Pull Request #437 · llm-d/llm-d-kv-cache

yash9263 · 2026-03-18T19:50:54Z

Issue: #396

This PR adds support to Invalidate KV cache via the AllBlocksCleared event.

Updated event processing in vllm_adapter.go and pool.go to parse and handle the AllBlocksCleared event.
Added Clear method to the Index interface and implemented it for all the index types (CostAwareMemoryIndex, InMemoryIndex, RedisIndex).
Introduced podToRequestKeys reverse index: podIdentifier -> []engineKeyToRequestKeyMapping for all the indexes.
- Clear: Looks up only the request keys for the target pod via podToRequestKeys
- Iterates those keys, and removes matching entries, prunes empty hashes + their engine-key mappings
- Updates the reverse index: drops the pod entirely if DeviceTier is empty, or trims to still-alive request keys if a specific tier was targeted

Testing

pkg/kvcache/kvblock/index_test.go

Examples:

Updated the kv_events/offline and valkey_example to simulate the Clear event.

github-actions · 2026-03-18T19:51:03Z

Unsigned commits detected! Please sign your commits.

For instructions on how to set up GPG/SSH signing and verify your commits, please see GitHub Documentation.

…nvalidate-kv-cache

vMaroon · 2026-03-19T12:24:01Z

Hi @yash9263 - thank you for the contribution. One detail that may have not been clear from the issue: an event coming from a certain endpoint should only clear the latter's entries.

E.g., if the index state maps pod A having blocks [x, y, z], and pod B having blocks [x, y], then an AllBlocksCleared event from pod A should remove the mappings of [x, y] -> pod A while keeping pod B, and completely remove z since now it maps to no one.

yash9263 · 2026-03-19T13:17:20Z

E.g., if the index state maps pod A having blocks [x, y, z], and pod B having blocks [x, y], then an AllBlocksCleared event from pod A should remove the mappings of [x, y] -> pod A while keeping pod B, and completely remove z since now it maps to no one.

Hi @vMaroon, apologies for the oversight and not clarifying this earlier.

Given the indexes are structured engineKey -> requestKey -> podIdentifier, to remove all the blocks associated with a pod, it will require traversing each block to remove the pod mappings and remove the engineKey if the requestKey is empty or removed. For the Redis index, it may require multiple round-trips.

Should we also consider deviceTier while evicting blocks for the given podIdentifier here as well? Since it was part of the original event struct:

only remove blocks for the specified device tier
Remove blocks across gpu and cpu, if no device tier is provided?

Would it be viable to introduce a reverse index pod -> engineKey to make evicting all blocks for a pod more efficient?
Or I can proceed with the O(n) scan and evict approach.

…nvalidate-kv-cache

… for pod-to-request key mappings Signed-off-by: yashwant <yashwant8530@gmail.com>

…nvalidate-kv-cache

yash9263 · 2026-03-30T10:40:08Z

@vMaroon, ready for review.

…lidate-kv-cache

pkg/kvcache/kvblock/in_memory.go

pkg/kvcache/kvblock/cost_aware_memory.go

pkg/kvcache/kvblock/in_memory.go

pkg/kvcache/kvblock/cost_aware_memory.go

Signed-off-by: yashwant <yashwant8530@gmail.com>

pkg/kvcache/kvblock/in_memory.go

pkg/kvcache/kvblock/index_test.go

…consistencies

gyliu513 · 2026-04-02T14:56:15Z

@yash9263 I think you need rebase your PR to latest code to resolve conflict?

…lidate-kv-cache

gyliu513

Good progress, thanks @yash9263 !

pkg/kvcache/kvblock/in_memory.go

pkg/kvevents/pool.go

pkg/kvcache/kvblock/cost_aware_memory.go

pkg/kvcache/kvblock/redis.go

pkg/kvcache/kvblock/in_memory.go

pkg/kvcache/kvblock/redis.go

pkg/kvcache/kvblock/cost_aware_memory.go

gyliu513 · 2026-04-03T21:47:47Z

@vMaroon @sagearc @yankay I think this is in good shape, it is great if you guys can check if this can be merged, thanks!

…lidate-kv-cache

gyliu513

/lgtm

Thanks @yash9263 !

pkg/kvcache/kvblock/cost_aware_memory.go

yash9263 requested review from dannyharnik, kfirtoledo, liu-cong and vMaroon as code owners March 18, 2026 19:50

github-actions bot requested review from hyeongyun0916, sagearc and yankay March 18, 2026 19:51

yash9263 force-pushed the feat-invalidate-kv-cache branch from 61fa5d4 to f1691b7 Compare March 19, 2026 07:07

feat:add support to invalidate KV cache via AllBlocksCleared event

ef9c002

yash9263 force-pushed the feat-invalidate-kv-cache branch from f1691b7 to ef9c002 Compare March 19, 2026 07:15

Merge branch 'main' of github.com:yash9263/llm-d-kv-cache into feat-i…

02bff24

…nvalidate-kv-cache

yash9263 marked this pull request as draft March 20, 2026 19:46

yash9263 added 3 commits March 21, 2026 16:16

Merge branch 'main' of github.com:yash9263/llm-d-kv-cache into feat-i…

3e8a2d1

…nvalidate-kv-cache

feat: update Clear to clear by PodEntry and introduce a reverse index…

f617650

… for pod-to-request key mappings Signed-off-by: yashwant <yashwant8530@gmail.com>

Merge branch 'main' of github.com:yash9263/llm-d-kv-cache into feat-i…

be894f6

…nvalidate-kv-cache

yash9263 force-pushed the feat-invalidate-kv-cache branch from 5215b68 to be894f6 Compare March 24, 2026 19:49

yash9263 marked this pull request as ready for review March 24, 2026 19:55

Merge branch 'main' of github.com:yash9263/llm-d-kv-cache into feat-i…

5cdb358

…nvalidate-kv-cache

Merge branch 'main' of github.com:llm-d/llm-d-kv-cache into feat-inva…

52653b4

…lidate-kv-cache

github-actions bot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label Mar 31, 2026

gyliu513 reviewed Apr 1, 2026

View reviewed changes

update engineKeyToRequest from struct to map & update tests

a6a6b2d

Signed-off-by: yashwant <yashwant8530@gmail.com>

yash9263 requested a review from gyliu513 April 1, 2026 20:10

gyliu513 reviewed Apr 2, 2026

View reviewed changes

pkg/kvcache/kvblock/in_memory.go Outdated Show resolved Hide resolved

pkg/kvcache/kvblock/index_test.go Outdated Show resolved Hide resolved

fix concurrent reverse index updates in InMemoryIndex & fix naming in…

d9074f1

…consistencies

yash9263 requested a review from gyliu513 April 2, 2026 12:38

Merge branch 'main' of github.com:llm-d/llm-d-kv-cache into feat-inva…

f7bac8b

…lidate-kv-cache

gyliu513 reviewed Apr 2, 2026

View reviewed changes

fix deadlock in clear

1fe7069

yash9263 requested a review from gyliu513 April 3, 2026 16:19

gyliu513 reviewed Apr 3, 2026

View reviewed changes

pkg/kvcache/kvblock/in_memory.go Outdated Show resolved Hide resolved

pkg/kvcache/kvblock/redis.go Outdated Show resolved Hide resolved

pkg/kvcache/kvblock/cost_aware_memory.go Show resolved Hide resolved

fix multiple round-trips in redis index Clear

88aa64d

yash9263 requested a review from gyliu513 April 4, 2026 17:57

Merge branch 'main' of github.com:llm-d/llm-d-kv-cache into feat-inva…

1c69e50

…lidate-kv-cache

gyliu513 reviewed Apr 7, 2026

View reviewed changes

pkg/kvcache/kvblock/cost_aware_memory.go Outdated Show resolved Hide resolved

refactor: replace if-else block with switch statement

d475bb9

Conversation

yash9263 commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 18, 2026

Uh oh!

vMaroon commented Mar 19, 2026

Uh oh!

yash9263 commented Mar 19, 2026

Uh oh!

yash9263 commented Mar 30, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gyliu513 commented Apr 2, 2026

Uh oh!

gyliu513 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gyliu513 commented Apr 3, 2026

Uh oh!

gyliu513 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yash9263 commented Mar 18, 2026 •

edited

Loading