You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Take note of the ID and the name of the unhealthy etcd member, because these values are needed later in the procedure. The `$ etcdctl endpoint health` command will list the removed member until the procedure of replacement is finished and a new member is added.
83
+
Take note of the ID and the name of the unhealthy etcd member because these values are needed later in the procedure. The `$ etcdctl endpoint health` command will list the removed member until the procedure of replacement is finished and a new member is added.
84
84
85
85
.. Remove the unhealthy etcd member by providing the ID to the `etcdctl member remove` command:
. Delete and re-create the control plane machine. After this machine is re-created, a new revision is forced and etcd scales up automatically.
193
+
. Check whether a control plane machine set exists by entering the following command:
194
194
+
195
-
If you are running installer-provisioned infrastructure, or you used the Machine API to create your machines, follow these steps. Otherwise, you must create the new master by using the same method that was used to originally create it.
195
+
[source,terminal]
196
+
----
197
+
$ oc -n openshift-machine-api get controlplanemachineset
198
+
----
199
+
200
+
* If the control plane machine set exists, delete and re-create the control plane machine. After this machine is re-created, a new revision is forced and etcd scales up automatically. For more information, see "Replacing an unhealthy etcd member whose machine is not running or whose node is not ready".
201
+
+
202
+
If you are running installer-provisioned infrastructure, or you used the Machine API to create your machines, follow these steps. Otherwise, you must create the new control plane by using the same method that was used to originally create it.
<1> The new machine, `clustername-8qw5l-master-3` is being created and is ready once the phase changes from `Provisioning` to `Running`.
248
255
+
249
-
It might take a few minutes for the new machine to be created. The etcd cluster Operator will automatically sync when the machine or node returns to a healthy state.
256
+
It might take a few minutes for the new machine to be created. The etcd cluster Operator automatically syncs when the machine or node returns to a healthy state.
250
257
+
251
258
[NOTE]
252
259
====
253
260
Verify the subnet IDs that you are using for your machine sets to ensure that they end up in the correct availability zone.
254
261
====
255
262
263
+
* If the control plane machine set does not exist, delete and re-create the control plane machine. After this machine is re-created, a new revision is forced and etcd scales up automatically.
264
+
+
265
+
If you are running installer-provisioned infrastructure, or you used the Machine API to create your machines, follow these steps. Otherwise, you must create the new control plane by using the same method that was used to originally create it.
266
+
267
+
.. Obtain the machine for the unhealthy member.
268
+
+
269
+
In a terminal that has access to the cluster as a `cluster-admin` user, run the following command:
270
+
+
271
+
[source,terminal]
272
+
----
273
+
$ oc get machines -n openshift-machine-api -o wide
274
+
----
275
+
+
276
+
.Example output
277
+
[source,terminal]
278
+
----
279
+
NAME PHASE TYPE REGION ZONE AGE NODE PROVIDERID STATE
... Change the `metadata.name` field to a new name.
335
+
+
336
+
Keep the same base name as the old machine and change the ending number to the next available number. In this example, `clustername-8qw5l-master-0` is changed to `clustername-8qw5l-master-3`.
<1> The new machine, `clustername-8qw5l-master-3` is being created and is ready once the phase changes from `Provisioning` to `Running`.
409
+
+
410
+
It might take a few minutes for the new machine to be created. The etcd cluster Operator automatically syncs when the machine or node returns to a healthy state.
411
+
256
412
. Turn the quorum guard back on by entering the following command:
257
413
+
258
414
[source,terminal]
@@ -337,4 +493,4 @@ If the output from the previous command lists more than three etcd members, you
337
493
[WARNING]
338
494
====
339
495
Be sure to remove the correct etcd member; removing a good etcd member might lead to quorum loss.
0 commit comments