Skip to content

Runner heartbeat and monitoring#144

Merged
danielholanda merged 5 commits intomainfrom
vgodsoe/runner-heartbeats
Mar 16, 2026
Merged

Runner heartbeat and monitoring#144
danielholanda merged 5 commits intomainfrom
vgodsoe/runner-heartbeats

Conversation

@vgodsoe
Copy link
Copy Markdown
Collaborator

@vgodsoe vgodsoe commented Mar 16, 2026

This PR adds 2 workflows to the repo:

  1. runner-heartbeat.yml - this workflow will ping each self-hosted runner in order to check that 1) they're alive still and 2) to ensure that a workflow has run on the runner within the last 14 days (this is GitHub's retention limit for self-hosted runners, after 14 days, the runner is automatically removed). At the end of the workflow, an artifact for each runner that was successful is uploaded.
  2. monitor-runners.yml - this workflow looks for the uploaded artifacts. If any runner's artifact is missing, a Teams message is sent to notify the team that we need to check on a runner.

The list of runners needs to be manually updated whenever a new runner is added.

Future work - once IT/AMD GitHub Admin approves the token, the workflows should be updated to automatically pull the list of available self-hosted runners that should be tested.

@vgodsoe vgodsoe self-assigned this Mar 16, 2026
@vgodsoe vgodsoe requested a review from danielholanda March 16, 2026 19:28
@danielholanda danielholanda merged commit 772994a into main Mar 16, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants