Skip to content

Argo Rollouts only supports full promote for canary on 100% available pods in the new RS #4365

@dlmather

Description

@dlmather

Summary

Per this check https://github.com/argoproj/argo-rollouts/blob/master/rollout/sync.go#L947, we're only able to fully promote the rollout for a canary when ALL pods in the new replicaset are available. However, we run systems like karpenter that are also running consolidation and the like up to the point of our PDB maxUnavailable, which can induce serious delay in the Rollout being able to fully complete. It would be fantastic if Argo-Rollouts was instead checking against the PDB value or in some way had a toleration for a certain amount of ongoing churn, or would otherwise allow this full promote to go through, even when fewer pods than the rollout spec declares are available.

Use Cases

We would generally use this across all of our rollouts, as we run a large amount of consolidation process that kicks in even as we perform gradual rollouts. Our current workaround is to disable consolidation on pods that are participating in the active deploy, but this can hurt our efficiency.


Message from the maintainers:

Impacted by this bug? Give it a 👍. We prioritize the issues with the most 👍.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions