-
Notifications
You must be signed in to change notification settings - Fork 132
Pull requests: llm-d/llm-d-kv-cache
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
PVC Evictor: add crawler tests, CI, and docs after layout changes
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#618
opened May 29, 2026 by
guygir
Collaborator
Loading…
feat(kvevents): parse HMA KV event metadata
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#612
opened May 26, 2026 by
sagearc
Collaborator
Loading…
feat: batch KV block copies via cudaMemcpyBatchAsync in fs connector
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#607
opened May 26, 2026 by
kfirtoledo
Collaborator
Loading…
2 tasks done
deps(go): bump the go-dependencies group across 1 directory with 13 updates
dependencies
Pull requests that update a dependency file
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#606
opened May 26, 2026 by
dependabot
Bot
Loading…
feat: Add PVC evictor BlockRemoved events
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#605
opened May 25, 2026 by
albertoperdomo2
Contributor
Loading…
fix(sglang): decode token_ids as typed []uint32 with bigram support
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#603
opened May 23, 2026 by
ryanx-sir
Loading…
4 tasks done
Better tmp file handling (#2)
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#600
opened May 20, 2026 by
Prgrmman
Loading…
fix: remove MIN_STAGING_BUFFER_SIZE that inflates KV cache on disk
size/XS
Denotes a PR that changes 0-9 lines, ignoring generated files.
#589
opened May 17, 2026 by
Jwrede
Loading…
1 of 2 tasks
Add SHA256-CBOR hashing algorithm for token processor with extra keys…
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#587
opened May 15, 2026 by
leipanhz
Loading…
ci: Wire fs_backend Python tests into CI
lgtm
Looks good to me, indicates that a PR is ready to be merged.
size/M
Denotes a PR that changes 30-99 lines, ignoring generated files.
#578
opened May 7, 2026 by
albertoperdomo2
Contributor
Loading…
build: pin protoc for protobuf generation
size/S
Denotes a PR that changes 10-29 lines, ignoring generated files.
#576
opened May 6, 2026 by
yankay
Collaborator
Loading…
fix: avoid LRU promotion on Lookup to unblock concurrent readers
lifecycle/stale
size/S
Denotes a PR that changes 10-29 lines, ignoring generated files.
Add redis lookup and improve nixl lookup
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#552
opened Apr 28, 2026 by
effi-ofer
Contributor
Loading…
feat: Add RenderBatchCompletion RPC for multi-prompt tokenization
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#538
opened Apr 24, 2026 by
albertoperdomo2
Contributor
Loading…
Add Hybrid Multi-head Attention (HMA) support for KV-Cache scoring
lifecycle/rotten
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#533
opened Apr 19, 2026 by
kapiljain1989
Loading…
feat: Add HMA support to FS connector
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#476
opened Mar 29, 2026 by
kfirtoledo
Collaborator
Loading…
4 tasks done
deps(go): bump google.golang.org/grpc from 1.77.0 to 1.79.3
dependencies
Pull requests that update a dependency file
lifecycle/rotten
#438
opened Mar 19, 2026 by
dependabot
Bot
Loading…
feat:add support to invalidate KV cache via AllBlocksCleared event
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#437
opened Mar 18, 2026 by
yash9263
Loading…
Add DP-aware routing support to KVEvents and indexing pipeline
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#370
opened Feb 28, 2026 by
satyamg1620
Loading…
ProTip!
Follow long discussions with comments:>50.