Open
Description
Description
Observed Behavior: We noticed that a node in EC2 is not reachable and failing health checks but karpenter is not terminating the node
Expected Behavior: Karpenter should terminate the node if it is not reachable
Reproduction Steps (Please include YAML): Not sure how since the node failed health checks
- currently the node has the following 2 taints
node.kubernetes.io/unreachable:NoSchedule
node.kubernetes.io/unreachable:NoExecute
- I can also confirm that in EC2 console I see that the instance failed health checks like 6 hours ago
- most of the pods on the node are stuck in
Terminating
state - there are no logs related to this node/nodeClaim in karpenter even after enabling debug logs
Versions:
- Chart Version: v0.34.0
- Kubernetes Version (
kubectl version
): 1.29.8
Activity