KEP-3322: add a new field maxRestartTimesOnFailure to podSpec #3339

kerthcet · 2022-06-06T09:19:54Z

Signed-off-by: kerthcet [email protected]

One-line PR description: Add a new field maxRestartTimes to podSpec when running into RestartPolicyOnFailure

Issue link: Add a new field maxRestartTimes to podSpec when running into RestartPolicyOnFailure #3322

Other comments:

keps/prod-readiness/sig-node/3322.yaml

kerthcet · 2022-06-09T06:03:47Z

cc @wojtek-t PTAL, thanks a lot.

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

wojtek-t · 2022-06-09T09:35:43Z

cc @wojtek-t PTAL, thanks a lot.

We're generally doing PRR once you already have SIG approval.

kerthcet · 2022-06-13T03:30:18Z

cc @dchen1107 for sig-node side review, also cc @hex108

scbizu · 2022-06-16T05:59:24Z

This KEP is helpful especially for those pods that holds a large resource set such as the JVM based pod . We give these kinds of pods a high limit threshold to speed up their startup , restart always policy will make this worse , even the node crash. In the old days , daemon control tools like supervisorctl has its startretries mechanism to limit the max startup retries , but for k8s deployments there is no replacement for it .

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

wojtek-t

It seems there is no SIG-level agreement on it - I made a quick pass and added some comments, but please ping me only once you have SIG-level approval.

wojtek-t · 2023-06-12T09:35:37Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+
+Pros:
+  * BackoffLimitPerIndex can reuse this functionality and no longer need to consider the restart times per index.
+  Specifically, it can avoid to use the annotation anymore, and works at a high level control by watching the pod status.


I actually agree with Aldo.
It's not an implementation detail - it's a fundamental thing of "pod recreations". If we want to track something across pod recreations (which is the case for jobs), maxRestarts won't solve it on its own - but actually may help with it.

wojtek-t · 2023-06-12T09:36:32Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+nitty-gritty.
+-->
+
+Add a new field `maxRestartTimesOnFailure` to the `pod.spec`. `maxRestartTimesOnFailure` only applies


+1 to Tim - I think that generalizing it to "Always" is natural and instead of making the API narrow, let's make it more generic.

wojtek-t · 2023-06-12T09:40:01Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+will rollout across nodes.
+-->
+
+Because kubelet will not upgrade/downgrade until api-server ready, so this will not impact


I don't understand it.

FWUW - this section isn't necessary for Alpha so given time bounds you may want to delete your answers from rollout and monitoring sections as their answers are controversial...

What I mean here is when upgrading api servers, we'll wait until all the apiservers are ready, then upgrade the kubelet. So if feature gates are enabled on some apiservers, we'll do nothing. Is this reasonable? Or what we want here is all the possibilities not the best practices, since it said as paranoid as possible. cc @wojtek-t

Removed for alpha

I would just comment out this section.

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

SergeyKanzhelev · 2023-06-13T20:41:03Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/kep.yaml

+reviewers:
+  - TBD
+approvers:
+  - TBD


need to find an approver. Without approver defined we unlikely can take it.

@mrunalp @derekwaynecarr @dchen1107 any of you want to take it?

pacoxu · 2023-06-14T10:20:39Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+```
+
+- If Pod's `RestartPolicy` is `Always` or `Never`, `maxRestartTimesOnFailure` is default to nil, and will not apply.
+- If Pod's `RestartPolicy` is `OnFailure`, `maxRestartTimesOnFailure` is also default to nil, which means infinite


two questions

if Pod's RestartPolicy is OnFailure and maxRestartTimesOnFailure is 0, invalid? or means Never?

is the maxRestartTimesOnFailure editable for the pod?

IMO:

if restartPolicy is "OnFailure" and maxRestarts is 0, it is effectively "Never". I don't think we need to special-case 0 to be a failure, but I don't feel strongly and could be argued either way.

let's start with "no" and see if there's really a need?

Yeah, it feels like never to me too.

thockin · 2023-06-15T16:41:51Z

Deadline is in ~8 hours -- Is this still hoping to land?

k8s-ci-robot · 2023-06-16T03:28:04Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: drinktee, kerthcet
Once this PR has been reviewed and has the lgtm label, please ask for approval from wojtek-t and additionally assign derekwaynecarr for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

keps/prod-readiness/OWNERS
keps/sig-node/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

logicalhan · 2023-09-30T01:38:00Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+
+Pros:
+  * BackoffLimitPerIndex can reuse this functionality and no longer need to consider the restart times per index.
+  Specifically, it can avoid to use the annotation anymore, and works at a high level control by watching the pod status.


Otherwise we could end up in another scenario where features work independently, but when paired together are mostly useless or have very strange semantics that are hard for users to understand, and hard to maintain.

I like this analysis.

logicalhan · 2023-09-30T01:41:01Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+Pros:
+  * Reduce the maintenance cost of Job API
+
+Cons:


But it's not the same pod. Job is a higher-level contruct.

Yes, it feels like a conflation of a pod-level maxRestarts and a job-level maxRecreates or something.

logicalhan · 2023-09-30T01:42:33Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+```
+
+- If Pod's `RestartPolicy` is `Always` or `Never`, `maxRestartTimesOnFailure` is default to nil, and will not apply.
+- If Pod's `RestartPolicy` is `OnFailure`, `maxRestartTimesOnFailure` is also default to nil, which means infinite


Yeah, it feels like never to me too.

logicalhan · 2023-09-30T01:48:04Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+restart times for backwards compatibility.
+
+In runtime, we'll check the sum of `RestartCount` of
+all containers [`Status`](https://github.com/kubernetes/kubernetes/blob/451e1fa8bcff38b87504eebd414948e505526d01/pkg/kubelet/container/runtime.go#L306-L335)


Pod.spec.containers[].maxRestarts reads well to me.

But it's a little weird because Containers also have a RestartPolicy and the only allowed value is Always. It would stop being intuitive how the attribute interactions actually work because now we're intermingling pod level attributes with container level attributes.

logicalhan · 2023-09-30T01:55:04Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+  CRI or CNI may require updating that component before the kubelet.
+-->
+
+The kubelet version should be consistent with the api-server version, or this feature


Does this enhancement involve coordinating behavior in the control plane and in the kubelet? How does an n-2 kubelet without this feature available behave when this feature is used?

In other words, if we have a kubelet at 1.26 and a kube-apiserver at 1.29 with the feature enabled, what is the expected behavior?

Will any other components on the node change? For example, changes to CSI, CRI or CNI may require updating that component before the kubelet.

I believe the answer to this should be no.

logicalhan · 2023-09-30T01:57:23Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+
+When we set the restartPolicy=OnFailure and set a specific maxRestartTimesOnFailure number,
+but Pod restarts times is not equal to the number.
+Or we can refer to the metric `pod_exceed_restart_times_size` for comparison.


Is this a new or existing metric?

logicalhan · 2023-09-30T01:57:58Z

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md

+will rollout across nodes.
+-->
+
+Because kubelet will not upgrade/downgrade until api-server ready, so this will not impact


I would just comment out this section.

k8s-triage-robot · 2024-01-22T02:21:53Z

The Kubernetes project currently lacks enough contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle stale
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

kerthcet · 2024-02-19T08:03:04Z

/remove-lifecycle stale

k8s-triage-robot · 2024-05-19T09:02:21Z

The Kubernetes project currently lacks enough contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle stale
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

adampl · 2024-05-24T18:09:10Z

hey, it's 2024, any update?

/lifecycle frozen

k8s-ci-robot · 2024-05-24T18:09:11Z

@adampl: The lifecycle/frozen label cannot be applied to Pull Requests.

In response to this:

hey, it's 2024, any update?

/lifecycle frozen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-triage-robot · 2024-06-23T18:38:57Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle rotten
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

adampl · 2024-06-24T13:04:23Z

/remove-lifecycle rotten

kannon92 · 2024-08-20T21:05:15Z

@kerthcet Are you going to work on this?

alculquicondor · 2024-08-21T12:08:07Z

FYI on a somewhat related KEP #4603

kannon92 · 2024-08-22T13:37:27Z

@alculquicondor I'm trying to see if this is a KEP that sig-node should help review for 1.32.

kannon92 · 2024-08-27T17:36:03Z

@lauralorenz presented the plan for CrashLoopBackOff.

I think that any kind of capping max restart time is out of scope for #4603.

kerthcet · 2024-08-28T04:43:25Z

Not yet, if you're interested, you can take it over, I have other works trapping me right now.

k8s-triage-robot · 2024-11-26T05:31:11Z

The Kubernetes project currently lacks enough contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle stale
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

GEownt · 2024-12-05T06:59:03Z

/remove-lifecycle stale

tallclair · 2025-01-31T23:26:43Z

I'm not convinced that this should be a Kubelet & pod level concern, but whether or not we eventually want it on the pod it seems like a good candidate for prototyping out-of-tree. High-level idea would be a controller that watches containers for restart, and deletes the pod when a policy deems it necessary (is deletion sufficient to mark a job as failed?). API could use annotations and/or CRD.

k8s-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Jun 6, 2022

k8s-ci-robot requested review from dchen1107 and derekwaynecarr June 6, 2022 09:20

k8s-ci-robot added kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory sig/node Categorizes an issue or PR as relevant to SIG Node. labels Jun 6, 2022

wojtek-t reviewed Jun 6, 2022

View reviewed changes

keps/prod-readiness/sig-node/3322.yaml Outdated Show resolved Hide resolved

wojtek-t self-assigned this Jun 6, 2022

kerthcet force-pushed the feat/add-maxRetries-to-podSpec branch 2 times, most recently from af1ce55 to d2e8b5d Compare June 6, 2022 11:07

This was referenced Jun 6, 2022

Add a new field maxRestartTimes to podSpec when running into RestartPolicyOnFailure #3322

Open

Add max retries when pod's restartPolicy is RestartPolicyOnFailure kubernetes/kubernetes#65797

Closed

kerthcet force-pushed the feat/add-maxRetries-to-podSpec branch from d2e8b5d to affd244 Compare June 6, 2022 14:57

kerthcet changed the title ~~[WIP] Feat: add a new field maxRestartTimes to podSpec~~ [WIP] KEP-3322: add a new field maxRestartTimes to podSpec Jun 6, 2022

kerthcet force-pushed the feat/add-maxRetries-to-podSpec branch 2 times, most recently from 525ea53 to c52de33 Compare June 7, 2022 05:34

kerthcet changed the title ~~[WIP] KEP-3322: add a new field maxRestartTimes to podSpec~~ KEP-3322: add a new field maxRestartTimes to podSpec Jun 7, 2022

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jun 7, 2022

FillZpp reviewed Jun 9, 2022

View reviewed changes

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md Outdated Show resolved Hide resolved

keps/sig-node/3322-add-maxRestartTimes-to-podSpec/README.md Outdated Show resolved Hide resolved

kerthcet force-pushed the feat/add-maxRetries-to-podSpec branch from ba886ba to b2c9f56 Compare June 14, 2022 05:50

kerthcet mentioned this pull request Jun 15, 2022

KEP-3329 Add KEP for Retriable and non-retriable Pod failures for Jobs #3374

Merged

wojtek-t reviewed Jun 17, 2022

View reviewed changes

lizzzcai mentioned this pull request Jul 5, 2022

Why restartPolicy is disabled? knative/serving#13041

Closed

xtaje mentioned this pull request Jul 28, 2022

Failed revisions can cause unlimited pods to be rapidly created knative/serving#13157

Closed

hex108 mentioned this pull request Aug 29, 2022

Add support for max local retries for RestartOnFailure policy kubernetes/kubernetes#79334

Closed

wojtek-t reviewed Jun 12, 2023

View reviewed changes

SergeyKanzhelev reviewed Jun 13, 2023

View reviewed changes

pacoxu reviewed Jun 14, 2023

View reviewed changes

drinktee approved these changes Jun 16, 2023

View reviewed changes

logicalhan reviewed Sep 30, 2023

View reviewed changes

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 22, 2024

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 19, 2024

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 19, 2024

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jun 23, 2024

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Jun 24, 2024

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 26, 2024

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 5, 2024

KEP-3322: add a new field maxRestartTimesOnFailure to podSpec #3339

Are you sure you want to change the base?

KEP-3322: add a new field maxRestartTimesOnFailure to podSpec #3339

Conversation

kerthcet commented Jun 6, 2022

kerthcet commented Jun 9, 2022

wojtek-t commented Jun 9, 2022

kerthcet commented Jun 13, 2022

scbizu commented Jun 16, 2022

wojtek-t left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thockin commented Jun 15, 2023

k8s-ci-robot commented Jun 16, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

k8s-triage-robot commented Jan 22, 2024

kerthcet commented Feb 19, 2024

k8s-triage-robot commented May 19, 2024

adampl commented May 24, 2024

k8s-ci-robot commented May 24, 2024

k8s-triage-robot commented Jun 23, 2024

adampl commented Jun 24, 2024

kannon92 commented Aug 20, 2024

alculquicondor commented Aug 21, 2024

kannon92 commented Aug 22, 2024

kannon92 commented Aug 27, 2024

kerthcet commented Aug 28, 2024

k8s-triage-robot commented Nov 26, 2024

GEownt commented Dec 5, 2024

tallclair commented Jan 31, 2025