Skip to content

Comments

ATO-2173: setup spot request queue alarms#7824

Open
isaac-GDS wants to merge 1 commit intomainfrom
ATO-2173/spot-request-queue-alarms
Open

ATO-2173: setup spot request queue alarms#7824
isaac-GDS wants to merge 1 commit intomainfrom
ATO-2173/spot-request-queue-alarms

Conversation

@isaac-GDS
Copy link
Contributor

Wider context of change

Move spot request queue into Orch

What’s changed

Added alarms:
oldest message in the queue for 10 seconds (2nd line)
oldest message in the queue for 60 seconds (pager duty)
message in DLQ (2nd line)
if not in prod then it will go to slack channel

Manual testing

ToDo test in dev

@isaac-GDS isaac-GDS requested review from a team as code owners February 13, 2026 16:07
@github-actions
Copy link

Java Tests Skipped

No Java files were changed in this pull request. Java tests will be skipped1.

Any Java files that are changed in a subsequent commit will trigger the Java tests.

Footnotes

  1. These tests will still show as passing in the PR status check, but will not actually have run.

cearl1
cearl1 previously approved these changes Feb 19, 2026
template.yaml Outdated
Threshold: 10
EvaluationPeriods: 1

SpotRequestQueueSecondLineAlarm:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor nit, but could we move this alarm above the pagerduty one so as you read downwards the alarms get more severe?

template.yaml Outdated
- !Ref SlackEvents
- !Ref SecondLineSlackEvents
AlarmName: !Sub ${Environment}-spot-request-queue-alarm
AlarmDescription: !Sub "Age of the oldest message on ${Environment} sqs spot request queue exceeds 10 seconds.ACCOUNT: di-orchestration-${Environment}. Runbook: https://govukverify.atlassian.net/wiki/x/VIFoCAE"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another minor nit, could we make this alarm description match the format of the pagerduty one? there's some slight differences

 - pagerDutyAlarm
 - dlq 2ndline alarm
 - 2ndline alarm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants