feat: Added UnavailableOfferingsTTL exposed as the flag & env variable in the karpenter #8013

ankitjain28may · 2025-04-24T10:16:01Z

Fixes #N/A

Description
Exposed UnavailableOfferingsTTL as both a flag and an environment variable in Karpenter. The default value of 3 minutes was too high for our environment, which operates with highly volatile workloads.

How was this change tested?

Does this change impact docs?

Yes, PR includes docs updates
Yes, issue opened: #
No

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

netlify · 2025-04-24T10:16:16Z

✅ Deploy Preview for karpenter-docs-prod ready!

Name	Link
🔨 Latest commit	`e5eff64`
🔍 Latest deploy log	https://app.netlify.com/sites/karpenter-docs-prod/deploys/680a10c6f1cb330008fb93fa
😎 Deploy Preview	https://deploy-preview-8013--karpenter-docs-prod.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

…he karpenter

DerekFrank

Leaving some preliminary style+structure feedback, will need to discuss with the provider working group to determine if there are any unintended consequences\product reasons to push back on accepting this PR. Would love if you could attend and share your use case directly!

https://karpenter.sh/docs/contributing/community-meetings/

DerekFrank · 2025-04-28T20:38:26Z

pkg/operator/options/options.go

@@ -53,6 +54,7 @@ func (o *Options) AddFlags(fs *coreoptions.FlagSet) {
 	fs.Float64Var(&o.VMMemoryOverheadPercent, "vm-memory-overhead-percent", utils.WithDefaultFloat64("VM_MEMORY_OVERHEAD_PERCENT", 0.075), "The VM memory overhead as a percent that will be subtracted from the total memory for all instance types when cached information is unavailable.")
 	fs.StringVar(&o.InterruptionQueue, "interruption-queue", env.WithDefaultString("INTERRUPTION_QUEUE", ""), "Interruption queue is the name of the SQS queue used for processing interruption events from EC2. Interruption handling is disabled if not specified. Enabling interruption handling may require additional permissions on the controller service account. Additional permissions are outlined in the docs.")
 	fs.IntVar(&o.ReservedENIs, "reserved-enis", env.WithDefaultInt("RESERVED_ENIS", 0), "Reserved ENIs are not included in the calculations for max-pods or kube-reserved. This is most often used in the VPC CNI custom networking setup https://docs.aws.amazon.com/eks/latest/userguide/cni-custom-network.html.")
+	fs.IntVar(&o.UnavailableOfferingsTTL, "unavailable-offerings-ttl", env.WithDefaultInt("UNAVAILABLE_OFFERINGS_TTL", 180), "The Unavailable offerings TTL is the time before offerings that were marked as unavailable are removed from the cache and are available for launch again.")


Is there a particular reason to switch this TTL to seconds over minutes? Either way, the description should include a '... in minutes ...' or an '... in seconds ...' to indicate to the user the correct unit.

DerekFrank · 2025-04-28T20:43:39Z

pkg/cache/unavailableofferings.go

@@ -37,7 +39,8 @@ type UnavailableOfferings struct {
 	SeqNum            uint64
 }

-func NewUnavailableOfferings() *UnavailableOfferings {
+func NewUnavailableOfferings(ctx context.Context) *UnavailableOfferings {
+	UnavailableOfferingsTTL := time.Duration(int64(options.FromContext(ctx).UnavailableOfferingsTTL)) * time.Second


In general I'd prefer to pass in only the TTL as an argument than the whole ctx object just to keep the function footprint small. Additionally, why store the option as an int just to cast it to an int64 just to convert it to a time.Duration?

jmdeal · 2025-04-28T23:49:32Z

The default value of 3 minutes was too high for our environment, which operates with highly volatile workloads

Are you able to elaborate on this? Your workloads shouldn't have any impact on what this value should be since they don't affect how long it will take for a spot pool to become available again. Generally speaking, I'm not convinced there is a use-case for tuning this value, but would like to know more about your use-case.

ankitjain28may requested a review from a team as a code owner April 24, 2025 10:16

ankitjain28may requested a review from jigisha620 April 24, 2025 10:16

Added UnavailableOfferingsTTL exposed as the flag & env variable in t…

e5eff64

…he karpenter

ankitjain28may force-pushed the f/unavailableOfferingsTTL branch from 911cce0 to e5eff64 Compare April 24, 2025 10:21

DerekFrank reviewed Apr 28, 2025

View reviewed changes

jmdeal added the triage/needs-information Marks that the issue still needs more information to properly triage label May 1, 2025

jonathan-innis added the lifecycle/stale label May 26, 2025

engedaam assigned DerekFrank Jun 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Added UnavailableOfferingsTTL exposed as the flag & env variable in the karpenter #8013

feat: Added UnavailableOfferingsTTL exposed as the flag & env variable in the karpenter #8013

Uh oh!

ankitjain28may commented Apr 24, 2025

Uh oh!

netlify bot commented Apr 24, 2025 •

edited

Loading

Uh oh!

DerekFrank left a comment

Uh oh!

DerekFrank Apr 28, 2025

Uh oh!

DerekFrank Apr 28, 2025

Uh oh!

jmdeal commented Apr 28, 2025

Uh oh!

Uh oh!

feat: Added UnavailableOfferingsTTL exposed as the flag & env variable in the karpenter #8013

Are you sure you want to change the base?

feat: Added UnavailableOfferingsTTL exposed as the flag & env variable in the karpenter #8013

Uh oh!

Conversation

ankitjain28may commented Apr 24, 2025

Uh oh!

netlify bot commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for karpenter-docs-prod ready!

Uh oh!

DerekFrank left a comment

Choose a reason for hiding this comment

Uh oh!

DerekFrank Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

DerekFrank Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

jmdeal commented Apr 28, 2025

Uh oh!

Uh oh!

netlify bot commented Apr 24, 2025 •

edited

Loading