Skip to content

[bug] Cluster scale down should not hang on destroying a disconnected machine #1044

Open
@steverfrancis

Description

@steverfrancis

Is there an existing issue for this?

  • I have searched the existing issues

Current Behavior

One a 3 node cluster (all CP nodes), if one of the nodes is in "Disconnected" state (because it is powered off), and I attempt to remove it from the cluster, Omni will get stuck in "Scaling down", with that machine showing "Destroying".
And just sit there. (Maybe there is a timeout, but it has been more than 5 minutes so far.)

Expected Behavior

Given that Omni knows that machine is in "DIsconnected" state, there is no reason to wait for its state to reconcile, or whatever it is doing.
The only way to get Omni to proceed was to remove the machine from the Machines tab.

This should be an instantaneous action when scaling down the cluster, with a warning "Machine X is currently disconnected. Removing it from the cluster will require the machine to be unregistered from Omni. The machine will need to wiped and reinstalled to be used again with Omni. Proceed?"

Steps To Reproduce

Create a 3 node cluster.
Power off one node, wait for it to be "Disconnected"
Attempt to scale down the cluster by removing the disconnected node.

What browsers are you seeing the problem on?

No response

Anything else?

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions