[Proposal] Gate-Controlled Scheduling for Cluster Autoscalers Compatibility by devzizu · Pull Request #4727 · volcano-sh/volcano

devzizu · 2025-11-15T22:25:36Z

What type of PR is this?

/kind documentation
/kind feature

What this PR does / why we need it:

This design proposal addresses an issue where Volcano incorrectly signals cluster autoscalers (e.g., CA or Karpenter) to scale up nodes even when pods are only waiting for queue capacity, not cluster resources. Currently, Volcano marks all unallocated pods as Unschedulable regardless of the reason, causing autoscalers to interpret queue constraints as insufficient node capacity.

This proposal introduces an opt-in feature using Kubernetes schedulingGates to hide queue-constrained pods from autoscalers, ensuring scale-up operations only trigger for legitimate node-fit failures. This aims to prevent unnecessary infrastructure costs and resource waste.

Which issue(s) this PR fixes:

Fixes #4710

volcano-sh-bot · 2025-11-15T22:25:48Z

Welcome @devzizu! It looks like this is your first PR to volcano-sh/volcano 🎉

JesseStutler · 2025-12-04T10:57:47Z

@kingeasternsun @hajnalmt Do you have time help @devzizu to take a look at this proposal?

hajnalmt · 2025-12-04T11:48:42Z

/cc @hajnalmt

Sure I am starting to review it 👍

hajnalmt

Thank you for this great proposal @devzizu.
I am looking forward to the implementation, great idea and solid design change.

I had mostly minor remarks except the capacity plugin change.
What I see is that the pods that are marked scheduling gated by volcano after this change with the annotation they can maybe considered to be inqueue but straight out to be considered allocated seems a too significant change.

Wouldn’t it be sufficient to adjust the DeductSchGatedResources function so that the tasks that are only volcano scheduling gated won't be deducted when hasOnlyVolcanoSchedulingGate returns true? This approach would also maintain compatibility with the overcommit plugin, which I’m slightly concerned about since it heavily depends on scheduling gate behavior.
--- I want to remark that this would mean that these annotated pods would move to inqueue and would never be in the scope of the enqueue action if every other scheduling gate is removed, which is probably what we want.

Thank you for the contribution and the idea once more!

docs/design/scheduling-gates-queue-admission.md

hajnalmt

Thank you so much for the updates @devzizu !
The reserved cache is a good idea, but I still had so many questions, please find them below!

docs/design/scheduling-gates-queue-admission.md

hajnalmt

We are getting there! Keep up the great work 😊

docs/design/scheduling-gates-queue-admission.md

volcano-sh-bot · 2026-01-15T13:05:46Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign william-wang for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

hajnalmt

Great Work!

Please squash your commits, and sign the DCO! I think this is quite ready.
Do you need any help with the implementation? We will need to add an e2e test for this too.
I think the it shall happen in a different PR though as the design is quite big.

docs/design/scheduling-gates-queue-admission.md

hajnalmt · 2026-01-23T08:03:20Z

/cc @JesseStutler

I think we can pull this in release-1.15

Signed-off-by: devzizu <jazevedo960@gmail.com>

Co-authored-by: João Soares <36493199+jmgsoares@users.noreply.github.com> Signed-off-by: devzizu <jazevedo960@gmail.com>

Signed-off-by: devzizu <jazevedo960@gmail.com>

Copilot

Pull request overview

This PR introduces a comprehensive design proposal for addressing a critical compatibility issue between Volcano scheduler and cluster autoscalers (CA/Karpenter). The proposal leverages Kubernetes scheduling gates to prevent unnecessary cluster scale-ups when pods are waiting for queue capacity rather than node capacity.

Changes:

Adds a detailed design document proposing an opt-in feature using scheduling gates to differentiate between queue capacity constraints and cluster capacity constraints
Introduces queue reservation mechanism to prevent race conditions when ungated pods remain unscheduled
Proposes extensions to the admission webhook, scheduler actions, and capacity plugin to implement the gate-controlled scheduling

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

docs/design/scheduling-gates-queue-admission.md

Copilot · 2026-02-15T14:15:28Z

docs/design/scheduling-gates-queue-admission.md

+
+```go
+type capacityPlugin struct {
+    // ... existing fields ...
+
+    // queueGateReservedTasks tracks tasks that passed capacity checks but cannot be scheduled
+    // These tasks reserve queue capacity to prevent other tasks from consuming it
+    // Rebuilt fresh at the start of each scheduling cycle in OnSessionOpen
+    queueGateReservedTasks map[api.QueueID]map[api.TaskID]*api.TaskInfo
+}
+```
+
+###### Capacity Check with Reserved Resources
+
+The `queueAllocatableWithReserved` method performs capacity checks including reserved resources:
+
+```go
+func (cp *capacityPlugin) queueAllocatableWithReserved(attr *queueAttr, candidate *api.TaskInfo, queue *api.QueueInfo) bool {


The reserved cache accounting could lead to resource starvation in certain scenarios. Consider: if pod-A fails to schedule due to node constraints and gets its gate removed, it reserves queue capacity. If new pods (pod-B, pod-C) are created that could actually schedule, they will be blocked by pod-A's reservation even though pod-A can't make progress until the autoscaler provisions new nodes. The proposal should discuss whether there should be a timeout or mechanism to release reservations for pods that have been ungated but haven't made progress for an extended period, preventing indefinite queue capacity blocking.

This is actually intended design, but the concern is a valid point: an ungated-but-unschedulable pod can hold queue capacity and block other pods that could schedule. We could add a timeout or similar mechanism to release reservations for pods that stay unschedulable for a long time, but that would need a clear policy and might over-commit the queue when those pods eventually get nodes.

@JesseStutler @hajnalmt – Do you think we should address this in the initial implementation (e.g. document the tradeoff and/or add a reservation timeout), or treat it as follow-up work once we see how it behaves in practice?

I think this is fine and this is the "standard" way of things happening. I am fine with it.

docs/design/scheduling-gates-queue-admission.md

Signed-off-by: devzizu <jazevedo960@gmail.com>

volcano-sh-bot added kind/documentation Categorizes issue or PR as related to documentation. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. labels Nov 15, 2025

volcano-sh-bot requested review from hwdef and lowang-bh November 15, 2025 22:25

volcano-sh-bot added the kind/feature Categorizes issue or PR as related to a new feature. label Nov 15, 2025

volcano-sh-bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Nov 15, 2025

devzizu force-pushed the proposal-4710 branch from c7f8885 to a58e67a Compare November 15, 2025 22:27

devzizu marked this pull request as ready for review November 15, 2025 22:30

volcano-sh-bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Nov 15, 2025

volcano-sh-bot requested review from archlitchi and merryzhou November 15, 2025 22:30

devzizu force-pushed the proposal-4710 branch from a58e67a to 5f0ed53 Compare November 15, 2025 22:31

volcano-sh-bot requested a review from hajnalmt December 4, 2025 11:48

hajnalmt suggested changes Dec 4, 2025

View reviewed changes

devzizu force-pushed the proposal-4710 branch from 61ee592 to 1f0df13 Compare December 15, 2025 08:48

hajnalmt mentioned this pull request Dec 16, 2025

Hierarchical Queue Mode Prevents Reclaim Functionality #4817

Open

hajnalmt suggested changes Dec 19, 2025

View reviewed changes

volcano-sh-bot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Dec 21, 2025

hajnalmt suggested changes Jan 15, 2026

View reviewed changes

hajnalmt reviewed Jan 19, 2026

View reviewed changes

docs/design/scheduling-gates-queue-admission.md Outdated Show resolved Hide resolved

docs/design/scheduling-gates-queue-admission.md Outdated Show resolved Hide resolved

devzizu force-pushed the proposal-4710 branch from 03bfa14 to 42003fa Compare January 19, 2026 17:10

volcano-sh-bot requested a review from JesseStutler January 23, 2026 08:03

devzizu mentioned this pull request Jan 31, 2026

feat: add scheduling gates for queue admission control devzizu/volcano#2

Draft

4 tasks

devzizu force-pushed the proposal-4710 branch from 42003fa to e035957 Compare February 8, 2026 14:09

devzizu and others added 20 commits February 15, 2026 14:08

docs: enhance scheduling gates proposal

882f0cb

Signed-off-by: devzizu <jazevedo960@gmail.com>

docs: add constraint about schedulingGates immutability

17b641c

Signed-off-by: devzizu <jazevedo960@gmail.com>

docs: apply rephrasing suggestions from code review

641bb9e

Co-authored-by: João Soares <36493199+jmgsoares@users.noreply.github.com> Signed-off-by: devzizu <jazevedo960@gmail.com>

docs: rephrase problem statement focusing on volcano's implementation

aa58c5a

Signed-off-by: devzizu <jazevedo960@gmail.com>

style: apply formatter to have max 120 cols width

ee2f618

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: add reference link for createPatch function

20c2602

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: rename annotation key and move it to an api constant

cc0ceed

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: use '-' JSON path operator in patchOperation

bb17928

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: add reference link for allocateResourcesForTasks function

609cdce

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: rename enqueueSchedulingGateRemoval to schedulingGateRemoval

8bce0ba

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: add reference for schGateRemovalWorkersWg usage

c60dc3c

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: rename schGateRemovalOperationCh to schGateRemovalStopCh

71d10ee

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: add reference link for Bind function

52dc3c5

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: remove redundant hasOnlyVolcanoSchedulingGate in Bind()

53becfe

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: enhance proposal with reserved resources in queue allocation checks

b40547f

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: add table of contents to proposal

2b8327a

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: improve reserved resources with a consistent per-task cache

8acac67

Signed-off-by: devzizu <jazevedo960@gmail.com>

refactor: propose generic reservation cleanup extension point

edf433b

Signed-off-by: devzizu <jazevedo960@gmail.com>

refactor: rename reservation cleanup function for consistency

fcfd371

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: rebase with Volcano's v1.14 release and ajust links

7b8f691

Signed-off-by: devzizu <jazevedo960@gmail.com>

Copilot AI review requested due to automatic review settings February 15, 2026 14:08

devzizu force-pushed the proposal-4710 branch from 7426729 to 7b8f691 Compare February 15, 2026 14:08

Copilot started reviewing on behalf of devzizu February 15, 2026 14:09 View session

fix: ensure gate is removed after allocation check

311ecac

Signed-off-by: devzizu <jazevedo960@gmail.com>

Copilot AI reviewed Feb 15, 2026

View reviewed changes

devzizu added 3 commits February 15, 2026 17:37

fix: remove candidate.SchGated check when adding to reserved cache

d673d69

Signed-off-by: devzizu <jazevedo960@gmail.com>

docs: small improvements on pod mutation

c62b67a

Signed-off-by: devzizu <jazevedo960@gmail.com>

fix: add warning log when users do not set annotation

2c67c87

Signed-off-by: devzizu <jazevedo960@gmail.com>

devzizu force-pushed the proposal-4710 branch from 60b9d34 to 2c67c87 Compare February 18, 2026 21:43

fix: ensure idempotency in pod schgate mutation

b467a0e

Signed-off-by: devzizu <jazevedo960@gmail.com>

Comments

Conversation

devzizu commented Nov 15, 2025

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Uh oh!

volcano-sh-bot commented Nov 15, 2025

Uh oh!

JesseStutler commented Dec 4, 2025

Uh oh!

hajnalmt commented Dec 4, 2025

Uh oh!

hajnalmt left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hajnalmt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hajnalmt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

volcano-sh-bot commented Jan 15, 2026

Uh oh!

hajnalmt left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hajnalmt commented Jan 23, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

devzizu Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

hajnalmt Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

hajnalmt left a comment •

edited

Loading

hajnalmt left a comment •

edited

Loading