Skip to content

Pull requests: llm-d/llm-d-inference-scheduler

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add approximate scoring mode with recency-weighted scoring
#808 opened Apr 9, 2026 by bongwoobak Contributor Loading…
(2/2) Import igw@v1.5.0-rc.1
#807 opened Apr 8, 2026 by zetxqx Contributor Loading…
Script to automate migration for GAIE code into llm-d
#804 opened Apr 8, 2026 by elevran Collaborator Loading…
migrate: import pkg/epp from gateway-api-inference-extension as pkg/epp/igw do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. hold PRs that are blocked on design, other features, release cycle, etc.
#803 opened Apr 8, 2026 by elevran Collaborator Draft
[feat] [cicd] Use distroless/static as default runtime base image
#795 opened Apr 2, 2026 by elevran Collaborator Loading…
feat: add disaggregation decider evaluation metrics
#793 opened Apr 1, 2026 by wenhug Loading…
4 tasks done
[cicd] Add changelog action
#791 opened Apr 1, 2026 by elevran Collaborator Loading…
Add support for s390x platform in Docker build do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command.
#763 opened Mar 25, 2026 by satyamg1620 Loading…
[cicd] Compare coverage against main and latest release-*
#757 opened Mar 25, 2026 by elevran Collaborator Loading…
perf: optimize sidecar proxy hot path for high-concurrency P/D routing
#746 opened Mar 21, 2026 by tlrmchlsmth Member Loading…
2 of 3 tasks
Basic implementation of dynamic LoRA adapters placement, based on shuffle sharding algorithm do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress.
#720 opened Mar 15, 2026 by dmitripikus Contributor Draft
Add program aware plugin
#707 opened Mar 11, 2026 by praveingk Draft
Adds configuration file for sidecar proxy
#683 opened Mar 6, 2026 by DhritiShikhar Loading…
Enable chunked decode in the routing proxy
#603 opened Feb 9, 2026 by andreyod Contributor Draft
add cookie-based affinity support lgtm "Looks good to me", indicates that a PR is ready to be merged.
#600 opened Feb 5, 2026 by roytman Collaborator Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.