-
Notifications
You must be signed in to change notification settings - Fork 17
Alerts for Merkle Root Upload and Claims #176
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
tip-router-operator-cli/src/claim.rs
Outdated
| .await?; | ||
|
|
||
| if validators_processed { | ||
| let epoch_percentage = epoch_percentage::get(&rpc_client).await.unwrap_or(0f64); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I believe we'd want to not emit in the event of the unwrap failing so that we're not erroneously writing 0% in the event of a 429 or timeout. Thoughts?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm with that. Best to avoid erroneous alerts.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| @@ -0,0 +1,10 @@ | |||
| use anyhow::Result; | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: would prefer this as just a helper in lib.rs than adding another file
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added a new "epoch_percentage" field (already used elsewhere), in creating new alerts for Merkle Root Uploads and MEV/PF Claims.
Three new alerts:
"Tip Router -- Merkle Root Upload Timer" https://jitolabs.grafana.net/alerting/grafana/beub9k1y9dfcwd/view
Alerts if the number of failed MR uploads has not reached zero after 10% of the epoch.
"Tip Router -- Claims Timer" https://jitolabs.grafana.net/alerting/grafana/beub9k1y9dfcwd/view
Alerts if the number of remaining claims has not reached zero after 33% of the epoch.
"Tip Router -- Claims Have Stalled" https://jitolabs.grafana.net/alerting/grafana/fetnet1krxatcf/view
Checks the num remaining claims now vs an hour ago, and alerts if those values are the same.