Skip to content

Control plane load balancer sends requests to unhealthy machines #516

Open
@dlipovetsky

Description

@dlipovetsky

Describe the bug

The control plane load balancer sends requests to unhealthy machines. I have seen this cause kubeadm join to fail on machines during bootstrapping.

The cause is that CAPVCD creates a load balancer whose "Graceful Disable Timeout" is set to the default, 1 minute. The timeout should be disabled. However, vmware/cloud-provider-for-cloud-director#292 does not allow the timeout to be disabled.

Reproduction steps

  1. Create a CAPVCD cluster.
  2. Confirm that the timeout is 1 minute in the control plane load balancer configuration:
    image
  3. Remove the kube-apiserver static pod from one of the control plane nodes.
  4. Send Kubernetes API requests to the load balancer. Some request should fail.

Expected behavior

The timeout should be disabled, and the load balancer should not send requests to an unhealthy machine.

Additional context

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions