Feature Area
Other
Problem Statement
While a vLLM instance starts up, the dual-pods controller polls /is_sleeping through the work queue's exponential backoff (5ms growing to 20s). Early retries are wastefully frequent; later ones add unnecessary latency. A fixed ~5s interval would be more appropriate.
From #443 (comment) (point 4).
Proposed Solution
TBD
Alternatives Considered
No response
Willingness to Contribute
Yes, I can submit a PR
Additional Context
No response
Feature Area
Other
Problem Statement
While a vLLM instance starts up, the dual-pods controller polls
/is_sleepingthrough the work queue's exponential backoff (5ms growing to 20s). Early retries are wastefully frequent; later ones add unnecessary latency. A fixed ~5s interval would be more appropriate.From #443 (comment) (point 4).
Proposed Solution
TBD
Alternatives Considered
No response
Willingness to Contribute
Yes, I can submit a PR
Additional Context
No response