Handling Attention Group id in KV events by kapiljain1989 · Pull Request #510 · llm-d/llm-d-kv-cache

kapiljain1989 · 2026-04-10T20:05:44Z

Add attention group tracking to KV-Cache indexer for Hybrid Multi-head Attention (HMA) support. This enables per-group cache hit scoring for models with multiple attention groups
(e.g., full attention + sliding window attention).

Changes

Core data model: Add StoredGroups []int field to PodEntry to track which attention groups have cached a block
Event schema: Add GroupIdx field to BlockStoredEvent and BlockRemovedEvent for per-group cache updates
vLLM adapter: Parse group_idx from vLLM KV events (msgpack fields [9] and [3])
Index implementations: Update all index backends (InMemory, CostAwareMemory, Redis) to:
- Use string-based cache keys ("podID@tier") instead of struct keys for efficient in-place updates
- Merge StoredGroups when adding duplicate entries
- Remove specific groups on eviction (delete entry only when no groups remain)
- Store JSON-serialized entries in Redis for group list persistence
Event processing: Convert single GroupIdx from events to StoredGroups list in index operations

Co-Authored-By: Claude Sonnet 4.5 noreply@anthropic.com

Signed-off-by: Kapil Jain <kapiljain1989@gmail.com>

kapiljain1989 requested review from dannyharnik, kfirtoledo, liu-cong and vMaroon as code owners April 10, 2026 20:05

github-actions bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Apr 10, 2026

github-actions bot requested review from hyeongyun0916, sagearc and yankay April 10, 2026 20:06

kapiljain1989 force-pushed the groupid branch from 747cb79 to e90e8b6 Compare April 10, 2026 22:11

Handling Group id in in KV events

fb613bd

Signed-off-by: Kapil Jain <kapiljain1989@gmail.com>

kapiljain1989 force-pushed the groupid branch from e90e8b6 to fb613bd Compare April 10, 2026 22:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling Attention Group id in KV events#510

Handling Attention Group id in KV events#510
kapiljain1989 wants to merge 1 commit intollm-d:mainfrom
kapiljain1989:groupid

kapiljain1989 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kapiljain1989 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant