Commit ff5acf3
committed
fix(smoke): treat UPDATE_ROLLBACK_COMPLETE as needs-recreate, not deployable
Post-merge smoke on main found all-demo in UPDATE_ROLLBACK_COMPLETE,
called use_canonical, and the subsequent CFN update failed because the
internal rollback (triggered by a leaf failure) tried to delete the
StorageFileSystem61EA7B3D — which has pending S3 export data and needs
forceDelete=true. The AWS::S3Files::FileSystem CFN handler does not
pass forceDelete, so the rollback fails and the deploy reports
"Failed to create/update the stack".
CFN technically accepts updates from UPDATE_ROLLBACK_COMPLETE, but for
the all-demo umbrella that state always hides this kind of half-cleaned
S3Files / nested-stack debris. Safer to mirror the ROLLBACK_COMPLETE
branch: delete-stack, wait, retain-on-DELETE_FAILED, then `continue`
the loop so the next iteration hits DOES_NOT_EXIST → use_canonical →
full resource sweep (which DOES force-delete file systems).
Cost: one extra ~60m recreate cycle when CFN rolled back. Benefit: the
umbrella self-recovers from S3Files-stuck rollbacks instead of needing
human cleanup.1 parent 47c3fba commit ff5acf3
1 file changed
Lines changed: 30 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
323 | 323 | | |
324 | 324 | | |
325 | 325 | | |
326 | | - | |
| 326 | + | |
327 | 327 | | |
328 | 328 | | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
| 353 | + | |
| 354 | + | |
| 355 | + | |
| 356 | + | |
| 357 | + | |
329 | 358 | | |
330 | 359 | | |
331 | 360 | | |
| |||
0 commit comments