Increase backup wait time and delete timeouts in node-backup #3662

OriolMunoz-da · 2026-01-23T09:42:19Z

Attempts to fix maybe https://github.com/DACH-NY/cn-test-failures/issues/7017

Signed-off-by: Oriol Muñoz <oriol.munoz@digitalasset.com>

martinflorian-da · 2026-01-23T09:52:56Z

cluster/scripts/node-backup.sh

      break
    else
-      (( i++ ))&& (( i > 300 )) &&_error "Timed out waiting for backup of $description db"
+      (( i++ ))&& (( i > WAIT_FOR_BACKUP_RETRIES )) && {


Did this time out too?

no, but I'd rather mirror the behavior between both

I like the consistency but sentences that contain both delete and backup always make scared and look more closely...

martinflorian-da · 2026-01-23T09:56:56Z

cluster/scripts/node-backup.sh

    else
-      (( i++ )) && (( i > 300 )) && _error "Timed out waiting for backup of $description PVC"
+      (( i++ )) && (( i > WAIT_FOR_BACKUP_RETRIES )) && {
+        kubectl delete volumesnapshot -n "$namespace" "$backupName";


How do we know we're allowed to delete here at all? If an operation is running then this will just fail too, won't it? But I guess that's fine because then we'll still retry?

then we'll still retry

Hm or will this script just die, because we're not capturing the error code of the kubectl delete?

hopefully it works and it gets deleted, but otherwise you're correct, it will exit 1 and retry

Hm and assuming the delete never works... will we loop forever here?

_error has an exit 1 inside, so either kubectl delete fails or _error exits

I guess it does... so fine then, not veto-ing if you feel confident about this.

martinflorian-da

Thanks!

WDYT about trying it without the deletes first? Not sure how much we're gaining for the risks here (of this not working as expected, of this deleting some other backup accidentally (low risk, I know))

Signed-off-by: Oriol Muñoz <oriol.munoz@digitalasset.com>

…e-backup (#3662)" This reverts commit 925dcf9. Signed-off-by: Itai Segall <itai.segall@digitalasset.com>

…e-backup (#3662)" (#3674) This reverts commit 925dcf9. Signed-off-by: Itai Segall <itai.segall@digitalasset.com>

OriolMunoz-da added 2 commits January 23, 2026 09:35

bump backup wait retries

548a789

Signed-off-by: Oriol Muñoz <oriol.munoz@digitalasset.com>

[static] delete backup on failure

4062ce2

Signed-off-by: Oriol Muñoz <oriol.munoz@digitalasset.com>

OriolMunoz-da requested review from isegall-da and martinflorian-da January 23, 2026 09:42

martinflorian-da reviewed Jan 23, 2026

View reviewed changes

martinflorian-da approved these changes Jan 23, 2026

View reviewed changes

[static] remove the deletes for now for safety

601f8e9

Signed-off-by: Oriol Muñoz <oriol.munoz@digitalasset.com>

OriolMunoz-da enabled auto-merge (squash) January 23, 2026 10:04

OriolMunoz-da merged commit 925dcf9 into main Jan 23, 2026
44 checks passed

OriolMunoz-da deleted the oriol/reset-backup-timeouts branch January 23, 2026 10:12

isegall-da added a commit that referenced this pull request Jan 23, 2026

[static] Revert "Increase backup wait time and delete timeouts in nod…

c48c458

…e-backup (#3662)" This reverts commit 925dcf9. Signed-off-by: Itai Segall <itai.segall@digitalasset.com>

isegall-da mentioned this pull request Jan 23, 2026

Revert "Increase backup wait time and delete timeouts in nod… #3674

Merged

6 tasks

isegall-da added a commit that referenced this pull request Jan 23, 2026

[static] Revert "Increase backup wait time and delete timeouts in nod…

bb9e8d5

…e-backup (#3662)" (#3674) This reverts commit 925dcf9. Signed-off-by: Itai Segall <itai.segall@digitalasset.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase backup wait time and delete timeouts in node-backup #3662

Increase backup wait time and delete timeouts in node-backup #3662

OriolMunoz-da commented Jan 23, 2026

Uh oh!

martinflorian-da Jan 23, 2026

Uh oh!

OriolMunoz-da Jan 23, 2026

Uh oh!

martinflorian-da Jan 23, 2026

Uh oh!

martinflorian-da Jan 23, 2026

Uh oh!

martinflorian-da Jan 23, 2026

Uh oh!

OriolMunoz-da Jan 23, 2026

Uh oh!

martinflorian-da Jan 23, 2026

Uh oh!

OriolMunoz-da Jan 23, 2026

Uh oh!

martinflorian-da Jan 23, 2026

Uh oh!

martinflorian-da left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Increase backup wait time and delete timeouts in node-backup #3662

Increase backup wait time and delete timeouts in node-backup #3662

Conversation

OriolMunoz-da commented Jan 23, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

martinflorian-da left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants