-
Notifications
You must be signed in to change notification settings - Fork 20
Add pool topology, unified gate interface, labels, and tier-priority dispatch #186
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
joeltg
wants to merge
14
commits into
llm-d-incubation:main
Choose a base branch
from
joeltg:feat/tier-priority
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Changes from all commits
Commits
Show all changes
14 commits
Select commit
Hold shift + click to select a range
da0399f
feat: add Labels/Pool/PoolDispatch framework, RMP registry, per-pool …
joeltg 940fe74
feat: replace DispatchGate with Verdict/Gate interface; clean up old …
joeltg e64ab63
feat: wire per-pool gate chains and subscription gate chains in pubsu…
joeltg e5510ad
docs: update README and remove stale dispatch-budget docs
joeltg 11fd6ac
feat: add tier-priority RMP and tier-priority-admission pool gate (PR 4)
joeltg d1e9dcc
refactor: remove Labels from ResultMessage; fix PR description
joeltg e93c084
fix: align cherry-picks with validated fork state
joeltg 54c26fd
fix: P0 ack leak in subscription-gate Terminate; P1 err-vs-Verdict co…
joeltg d69dcca
fix: tierpriority catch-all, helm dead flags, SS gating cleanup, pubs…
joeltg 31021ce
Merge upstream main (PR #183) — discard classifier-mode-on-quota-gate
joeltg d6caede
fix: bound gate blocking by transport lease; require publish before a…
joeltg 458bba3
fix: bound tier-priority RMP per-subscription backlog
joeltg 78f4d76
fix: enforce publish-before-ack for subscription-gate Drop(result)
joeltg 243aa1f
fix: redis sorted-set gate parity + closed-channel safety
joeltg File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Worker goroutines fire-and-forget: Workers are spawned with bare
go asyncworker.Worker(...)and nothing waits for them to finish — the old WaitGroup was removed. After<-ctx.Done()returns (line 224),main()exits immediately, so in-flight requests are abandoned mid-dispatch. This can lose results or leave partial state in Redis.Add a
sync.WaitGroup(or similar) so the shutdown path waits for all workers to drain before exiting.