Skip to content

Commit 5112fd9

Browse files
committed
TELCODOCS#2230: Coordinating reboots for configuration changes
1 parent 49eb7d3 commit 5112fd9

File tree

3 files changed

+47
-1
lines changed

3 files changed

+47
-1
lines changed

Diff for: edge_computing/policygenerator_for_ztp/ztp-configuring-managed-clusters-policygenerator.adoc

+2
Original file line numberDiff line numberDiff line change
@@ -55,6 +55,8 @@ include::modules/ztp-customizing-a-managed-site-using-pgt.adoc[leveloffset=+1]
5555
5656
include::modules/ztp-monitoring-policy-deployment-progress.adoc[leveloffset=+1]
5757

58+
include::modules/ztp-coordinating-reboots-for-config-changes.adoc[leveloffset=+1]
59+
5860
include::modules/ztp-validating-the-generation-of-configuration-policy-crs.adoc[leveloffset=+1]
5961

6062
include::modules/ztp-restarting-policies-reconciliation.adoc[leveloffset=+1]

Diff for: modules/defer-applicaton-tuning-example.adoc

+5-1
Original file line numberDiff line numberDiff line change
@@ -55,4 +55,8 @@ spec:
5555

5656
<1> The `include` directive is used to inherit the `openshift-node-performance-performance` profile. This is a best practice to ensure that the profile is not missing any required settings.
5757
<2> The `kernel.shmmni` sysctl parameter is being changed to `8192`.
58-
<3> The `machineConfigLabels` field is used to target the `worker-cnf` role. Configure a `MachineConfigPool` resource to ensure the profile is applied only to the correct nodes.
58+
<3> The `machineConfigLabels` field is used to target the `worker-cnf` role. Configure a `MachineConfigPool` resource to ensure the profile is applied only to the correct nodes.
59+
60+
You can use {cgu-operator-full} to coordinate reboots across a fleet of spoke clusters when configuration changes require a reboot. These configuration changes include updates to tuning profiles that modify kernel parameters or system behavior.{cgu-operator} ensures that only nodes with a degraded tuned profile, indicating a reboot is needed, are rebooted. Instead of rebooting nodes after each individual change, you can apply all configuration updates through policies and then trigger a single, coordinated reboot.
61+
62+
For more information about coordinated reboots, see xref:../../edge_computing/policygenerator_for_ztp/ztp-configuring-managed-clusters-policygenerator.adoc#ztp-coordinating-reboots-for-config-changes_ztp-configuring-managed-clusters-policygenerator[Coordinating reboots for configuration changes]
+40
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,40 @@
1+
// Module included in the following assemblies:
2+
//
3+
// * scalability_and_performance/ztp_far_edge/ztp-configuring-managed-clusters-policies.adoc
4+
5+
:_mod-docs-content-type: PROCEDURE
6+
[id="ztp-coordinating-reboots-for-config-changes_{context}"]
7+
= Coordinating reboots for configuration changes
8+
9+
You can use {cgu-operator-full} to coordinate reboots across a fleet of spoke clusters when configuration changes require a reboot.
10+
11+
The following types of configuration changes typically require a reboot:
12+
13+
* Updates to tuning profiles that modify kernel parameters or system behavior.
14+
* Kubelet configuration updates.
15+
* Node-level changes delivered through `MachineConfig`, such as `sysctl` settings or system service changes.
16+
17+
{cgu-operator} ensures that only nodes with a degraded tuned profile, indicating a reboot is needed, are rebooted.
18+
19+
Instead of rebooting nodes after each individual change, you can apply all configuration updates through policies and then trigger a single, coordinated reboot.
20+
21+
.Prerequisites
22+
23+
* You have installed the {oc-first}.
24+
* You have logged in to the hub cluster as a user with `cluster-admin` privileges.
25+
* You have deployed and configured {cgu-operator}.
26+
27+
.Procedure
28+
29+
. Create the configuration policies and policy bindings for the tuning or configuration changes.
30+
31+
. Create a reboot policy on the hub cluster. This policy checks for degraded tuned profiles that indicate a reboot is needed.
32+
33+
. Create and apply the `ClusterGroupUpgrade` (CGU) custom resource (CR). In the `spec.managedPolicies` list, include all relevant configuration policies first, followed by the reboot policy and the optional `mcp-validator` policy.
34+
+
35+
After you apply the CGU custom resource, {cgu-operator} rolls out the configuration policies in order. When all policies are compliant, {cgu-operator} applies the reboot policy. This triggers a reboot of all nodes in the specified `MachineConfigPool`.
36+
37+
.Verification
38+
39+
* Confirm that all nodes in the `MachineConfigPool` have rebooted.
40+
* Verify that the `MachineConfigPool` reaches the `Updated` state.

0 commit comments

Comments
 (0)