Skip to content

Commit a3f2da2

Browse files
add service team runbook
1 parent 6a92503 commit a3f2da2

1 file changed

Lines changed: 139 additions & 0 deletions

File tree

Lines changed: 139 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,139 @@
1+
---
2+
title: Stage 3 Migration – Service Team Runbook
3+
last_reviewed_on: 2026-03-09
4+
review_in: 6 months
5+
---
6+
7+
# <%= current_page.data.title %>
8+
9+
## Purpose
10+
11+
This runbook outlines the actions expected from service teams during **Stage 3 migration of AWS accounts to the hardened Workloads organisational unit (OU)** within the Cabinet Office AWS Control Tower environment.
12+
13+
Stage 3 introduces additional **Service Control Policy (SCP)** enforcement aligned with organisational platform guardrails. These guardrails strengthen security, governance, and operational consistency across service accounts.
14+
15+
The objective of this migration stage is to move service workloads into a governed organisational unit while minimising disruption to running services.
16+
17+
A summary of the SCP guardrails applied within the Workloads OU is available in the Digital Handbook:
18+
19+
[Workloads SCP Guardrails – Digital Handbook](https://digital-handbook.cabinetoffice.gov.uk/docs/cloud/aws-org-units-in-control-tower.html#workloads-scps)
20+
21+
## Scope
22+
23+
This runbook applies to all **service teams whose AWS accounts are being migrated from the Transition OU to the Workloads OU** as part of the AWS organisation migration programme.
24+
25+
It applies to:
26+
27+
* Development
28+
* Non-production
29+
* Staging
30+
* Production
31+
32+
accounts being migrated during Stage 3.
33+
34+
For most services, no architectural changes are required. Where potential issues are identified during the **pre-migration AWS Config assessment**, these will be reviewed collaboratively with service teams before migration.
35+
36+
## Migration Process
37+
38+
### 1. Before Migration
39+
40+
Prior to the agreed migration window, service teams must:
41+
42+
* Review the AWS Config pre-migration assessment provided by the Platform Engineering team
43+
* Review the Workloads OU SCP guardrails documented in the Digital Handbook
44+
* Confirm availability from the service team during the migration window
45+
* Ensure no major releases are scheduled during the migration window
46+
* Be prepared to pause CI/CD pipelines if requested
47+
48+
No further action is required unless specific risks have been identified during the pre-migration assessment.
49+
50+
### 2. At the Start of the Migration Window
51+
52+
At the beginning of the agreed migration window:
53+
54+
* Service teams will be added to a **temporary migration contact group**
55+
* The group will include a member of the **Platform Engineering team (COPE)** and a member of the **Cyber Security team**
56+
57+
Service teams will be asked to:
58+
59+
* Pause CI/CD pipelines
60+
* Confirm no active deployments are in progress
61+
* Confirm no live production incidents are ongoing
62+
63+
Once these checks are complete, the Platform Engineering team will proceed with the **OU migration**.
64+
65+
The OU move itself typically completes within a few minutes.
66+
67+
### 3. Immediately After Migration (First 60 Minutes)
68+
69+
Following the OU move, service teams should validate:
70+
71+
* Application health
72+
* Deployment capability
73+
* Cross-account access (if applicable)
74+
* End-user access
75+
76+
If everything functions as expected, no further action is required.
77+
78+
If any of the following issues are observed:
79+
80+
* `AccessDenied` errors
81+
* Failed deployments
82+
* Region-related errors
83+
* Unexpected permission issues
84+
85+
These should be raised immediately in the migration contact group.
86+
87+
The Platform Engineering team will assess the issue and determine the appropriate next steps.
88+
89+
### 4. If Issues Occur
90+
91+
If a **critical service impact** is identified during the migration window:
92+
93+
* The Platform Engineering team may initiate a **rollback to the Transition OU**
94+
* Findings will be reviewed collaboratively with the service team
95+
* A revised migration plan will be agreed if required
96+
97+
Service teams are **not expected to modify SCPs or implement policy changes directly**.
98+
99+
### 5. After Migration
100+
101+
Within 24 hours of the migration:
102+
103+
* Confirm continued service stability
104+
* Resume normal deployment activity
105+
106+
No further action is required unless contacted by the Platform Engineering team.
107+
108+
## Roles and Responsibilities
109+
110+
**Service Teams** are responsible for:
111+
112+
* Reviewing the pre-migration assessment
113+
* Ensuring availability during the migration window
114+
* Pausing deployments when requested
115+
* Validating service health following migration
116+
* Reporting any issues during the migration window
117+
118+
The **Platform Engineering Team (COPE)** is responsible for:
119+
120+
* Coordinating the migration window
121+
* Performing the OU move
122+
* Assessing issues raised during the migration
123+
* Initiating rollback if required
124+
125+
The **Cyber Security Team** is responsible for:
126+
127+
* Supporting migration monitoring
128+
* Providing security oversight during the migration process
129+
130+
## What This Migration Does Not Require
131+
132+
Service teams are **not required to**:
133+
134+
* Redesign their architecture
135+
* Refactor CI/CD pipelines
136+
* Remove IAM users
137+
* Make proactive changes unless specifically discussed during the assessment
138+
139+
Most accounts migrate to the Workloads OU without impact.

0 commit comments

Comments
 (0)