arcade
arcade copied to clipboard
Production - [Alerting] Apple simulator failure rate alert
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=dci-mac-build-057} 83
@dotnet/dnceng, please investigate
Automation information below, do not change
Grafana-Automated-Alert-Id-36d07fceeaf0472b804d8358b2198eac
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=dci-mac-build-045} 100
- FailureRate {Machine=dci-mac-build-054} 100
After discussion with @premun, decided to exclude app failure (ExitCode 80
) from the alert as its not an infrastructure issue. PR for the change -> https://dev.azure.com/dnceng/internal/_git/dotnet-helix-service/pullrequest/25729
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=dci-mac-build-008} 100
- FailureRate {Machine=dci-mac-build-045} 100
:green_heart: Metric state changed to ok
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=dci-mac-build-009} 100
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=dci-mac-build-009} 100
:green_heart: Metric state changed to ok
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
:green_heart: Metric state changed to ok
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.