Add docs for in-place updates #7999

omerap12 · 2025-04-01T06:57:26Z

What type of PR is this?

/kind documentation

What this PR does / why we need it:

Add docs for in place

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

Does this PR introduce a user-facing change?

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Add docs for in-place updates

Depends on #7673
/hold

This reverts commit b66b446.

See kubernetes#7862

…prover Add adrianmoisey to VPA approvers

…ediately after cutting a release branch so that new development is done against the new version

Bump VPA version in main branch and change release process

this change ensures that when DecreaseTargetSize is counting the nodes that it does not include any instances which are considered to be pending (i.e. not having a node ref), deleting, or are failed. this change will allow the core autoscaler to then decrease the size of the node group accordingly, instead of raising an error. This change also add some code to the unit tests to make detection of this condition easier.

…ze-fix make DecreaseTargetSize more accurate for clusterapi

Add missing tests

This change makes it so that when a failed machine is found during the `findScalableResourceProviderIDs` it will always gain a normalized provider ID with failure guard prepended. This is to ensure that machines which have gained a provider ID from the infrastructure and then later go into a failed state can be properly removed by the autoscaler when it wants to correct the size of a node group.

…-detection improve failed machine detection in clusterapi

Signed-off-by: Jack Francis <[email protected]>

…odeHasValidProviderID capi: node and provider ID accounting funcs

* Update default value for scaleDownDelayAfterDelete Setting defaut value for scaleDownDelayAfterDelete to be scanInterval instead of 0. * Revert the change and fix the flag description

…up-sample-scheduled Allow using scheduled pods as samples in proactive scale up

Fix log for node filtering in static autoscaler

omerap12 · 2025-04-01T06:59:43Z

/assign @maxcao13

Please double-check that I didn’t miss anything? I kept the docs high-level, avoiding technical details since they’re not relevant to the end user.
Appreciate you taking a look :)

Signed-off-by: Omer Aplatony <[email protected]>

adrianmoisey · 2025-04-01T12:41:49Z

Thanks for making this!
Does it make sense to put this in the in-place-updates branch, along with all the other In-place work?

maxcao13

Looks great, have a couple suggestions. Thanks for writing this!

vertical-pod-autoscaler/docs/features.md

maxcao13 · 2025-04-01T18:42:28Z

Thanks for making this!
Does it make sense to put this in the in-place-updates branch, along with all the other In-place work?

I think we can do that or we can just merge it afterwards, both seem reasonable to me.

Maybe we should merge this after the in-place-updates branch is merged in case there are any extremely final review changes that affect this doc?

omerap12 · 2025-04-02T06:31:56Z

Thanks for making this!
Does it make sense to put this in the in-place-updates branch, along with all the other In-place work?

I think we can do that or we can just merge it afterwards, both seem reasonable to me.

Maybe we should merge this after the in-place-updates branch is merged in case there are any extremely final review changes that affect this doc?

Agree.

Signed-off-by: Omer Aplatony <[email protected]>

vertical-pod-autoscaler/docs/features.md

Co-authored-by: Adrian Moisey <[email protected]>

raywainman · 2025-04-04T20:33:45Z

vertical-pod-autoscaler/docs/features.md

+* All containers in a pod are updated together (partial updates not supported)
+* Memory downscaling requires careful consideration to prevent OOMs
+* Updates still respect VPA's standard update conditions and timing restrictions
+* In-place updates will fail for pods with Guaranteed QoS class (requires pod recreation)


Is this true?

If the QoS class is guaranteed then requests == limits and VPA will just update both together (since the ratio between them is 1.0), which means the QoS class will never change.

My understanding is that the in-place feature will fail when you try to change the QoS class.

Am I missing something?

You are right, my misunderstanding, sorry.
fixed in: 125209d

Signed-off-by: Omer Aplatony <[email protected]>

omerap12 · 2025-05-04T06:33:58Z

Gonna push this to in-place
/close

k8s-ci-robot · 2025-05-04T06:34:04Z

@omerap12: Closed this PR.

Details

In response to this:

Gonna push this to in-place
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

rakechill and others added 30 commits February 11, 2025 14:31

Update skewer version on master branch

b66b446

Undo previous changes made by go mod vendor

72665b3

This reverts commit b66b446.

Update only skewer with go get dep@ver

ed62128

Support CPU Startup Boost in VPA

c76a771

Add AEP ID

4b98746

See kubernetes#7862

Fixes histograms becoming empty after loaded from checkpoints

2aba671

Convert ClusterState to interface

18ed036

Store InitContainers in PodState

481f8db

Drop MetricSamples for InitContainers

3c7689f

Address review feedback

33871fa

Address comments and wrap lines

bd363cd

Add adrianmoisey to VPA approvers

0f5fe42

Merge pull request kubernetes#7935 from adrianmoisey/add-adrian-as-ap…

4400ed9

…prover Add adrianmoisey to VPA approvers

change our release process to bump the version in the main branch imm…

29d9088

…ediately after cutting a release branch so that new development is done against the new version

Merge pull request kubernetes#7939 from raywainman/vpa-version-bump

f04fd5b

Bump VPA version in main branch and change release process

Fix typo

9043687

Address comments + add a feature enablement/rollback section

e347836

Merge pull request kubernetes#7929 from elmiko/issue-7928-decrease-si…

9937f8f

…ze-fix make DecreaseTargetSize more accurate for clusterapi

Allow using scheduled pods as samples in proactive scale up

9a5e3d9

Fix log for node filtering in static autoscaler

105429c

Add missing tests

Merge pull request kubernetes#7950 from elmiko/improve-failed-machine…

1f65569

…-detection improve failed machine detection in clusterapi

capi: node and provider ID accounting funcs

4aa4657

Signed-off-by: Jack Francis <[email protected]>

s/nodeHasValidProviderID/isProviderIDNormalized

7b5e101

Signed-off-by: Jack Francis <[email protected]>

Merge pull request kubernetes#7952 from jackfrancis/capi-providerID-n…

dc57f7c

…odeHasValidProviderID capi: node and provider ID accounting funcs

Update default value for scaleDownDelayAfterDelete (kubernetes#7957)

5268053

* Update default value for scaleDownDelayAfterDelete Setting defaut value for scaleDownDelayAfterDelete to be scanInterval instead of 0. * Revert the change and fix the flag description

Merge pull request kubernetes#7944 from norbertcyran/proactive-scale-…

10bb546

…up-sample-scheduled Allow using scheduled pods as samples in proactive scale up

Address more comments in AEP

455d290

Merge pull request kubernetes#7949 from ystryuchkov/master

990ab04

Fix log for node filtering in static autoscaler

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Apr 1, 2025

k8s-ci-robot assigned maxcao13 Apr 1, 2025

Add docs for in-place updates

63ed537

Signed-off-by: Omer Aplatony <[email protected]>

omerap12 force-pushed the in-place-docs branch from 84e6e4c to 63ed537 Compare April 1, 2025 07:19

maxcao13 reviewed Apr 1, 2025

View reviewed changes

vertical-pod-autoscaler/docs/features.md Show resolved Hide resolved

vertical-pod-autoscaler/docs/features.md Outdated Show resolved Hide resolved

vertical-pod-autoscaler/docs/features.md Outdated Show resolved Hide resolved

omerap12 added 2 commits April 2, 2025 06:47

Add AEP link and refine intro for VPA in-place updates documentation

50a1035

Signed-off-by: Omer Aplatony <[email protected]>

add feature state

a34a352

Signed-off-by: Omer Aplatony <[email protected]>

omerap12 requested a review from maxcao13 April 2, 2025 06:51

adrianmoisey reviewed Apr 2, 2025

View reviewed changes

vertical-pod-autoscaler/docs/features.md Outdated Show resolved Hide resolved

adrianmoisey reviewed Apr 2, 2025

View reviewed changes

vertical-pod-autoscaler/docs/features.md Outdated Show resolved Hide resolved

adrianmoisey reviewed Apr 2, 2025

View reviewed changes

vertical-pod-autoscaler/docs/features.md Outdated Show resolved Hide resolved

omerap12 and others added 3 commits April 2, 2025 20:38

Update vertical-pod-autoscaler/docs/features.md

e3fbeff

Co-authored-by: Adrian Moisey <[email protected]>

Update vertical-pod-autoscaler/docs/features.md

3f2845c

Co-authored-by: Adrian Moisey <[email protected]>

Update vertical-pod-autoscaler/docs/features.md

5b5ae39

Co-authored-by: Adrian Moisey <[email protected]>

raywainman reviewed Apr 4, 2025

View reviewed changes

fixed wrong statement

125209d

Signed-off-by: Omer Aplatony <[email protected]>

omerap12 changed the base branch from master to in-place-updates May 4, 2025 06:32

k8s-ci-robot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels May 4, 2025

k8s-ci-robot closed this May 4, 2025

omerap12 deleted the in-place-docs branch May 4, 2025 06:34

omerap12 restored the in-place-docs branch May 4, 2025 06:35

omerap12 deleted the in-place-docs branch May 4, 2025 06:42

Add docs for in-place updates #7999

Add docs for in-place updates #7999

Uh oh!

Conversation

omerap12 commented Apr 1, 2025

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Uh oh!

omerap12 commented Apr 1, 2025

Uh oh!

adrianmoisey commented Apr 1, 2025

Uh oh!

maxcao13 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maxcao13 commented Apr 1, 2025

Uh oh!

omerap12 commented Apr 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

raywainman Apr 4, 2025

Choose a reason for hiding this comment

Uh oh!

omerap12 Apr 6, 2025

Choose a reason for hiding this comment

Uh oh!

omerap12 commented May 4, 2025

Uh oh!

k8s-ci-robot commented May 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

18 participants