arcade
arcade copied to clipboard
Production - [Alerting] Android devices disconnected
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-055} 100
@dotnet/dnceng, please investigate
Automation information below, do not change
Grafana-Automated-Alert-Id-35f560112f7a4bfabf9fd69bc1bd76fa
IcM ticket for the machine already exists -> https://portal.microsofticm.com/imp/v3/incidents/details/334311616/home
Also, disabled the machine
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-055} 100
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-055} 100
- FailureRate {Machine=DNCENGWIN-116} 100
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-055} 100
- FailureRate {Machine=DNCENGWIN-078} 94
- FailureRate {Machine=DNCENGWIN-116} 93
Disabled 116
and 078
+ created an IcM ticket for them -> https://portal.microsofticm.com/imp/v3/incidents/details/334893393/home
Enabled 055
back as the IcM ticket for it was successfully handled
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-055} 100
- FailureRate {Machine=DNCENGWIN-078} 100
- FailureRate {Machine=DNCENGWIN-116} 85
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-022} 85
- FailureRate {Machine=DNCENGWIN-078} 94
- FailureRate {Machine=DNCENGWIN-116} 85
Disabled 022
, added it to already opened IcM
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-022} 92
- FailureRate {Machine=DNCENGWIN-066} 90
- FailureRate {Machine=DNCENGWIN-078} 100
- FailureRate {Machine=DNCENGWIN-116} 85
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-022} 92
- FailureRate {Machine=DNCENGWIN-066} 94
- FailureRate {Machine=DNCENGWIN-078} 94
- FailureRate {Machine=DNCENGWIN-116} 85
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-022} 92
- FailureRate {Machine=DNCENGWIN-078} 100
- FailureRate {Machine=DNCENGWIN-116} 85
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-022} 92
- FailureRate {Machine=DNCENGWIN-116} 83
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-022} 92
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-022} 92
:green_heart: Metric state changed to ok
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-075} 100
:green_heart: Metric state changed to ok
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-058} 100
- FailureRate {Machine=DNCENGWIN-120} 100
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-058} 92
- FailureRate {Machine=DNCENGWIN-120} 96
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-006} 100
- FailureRate {Machine=DNCENGWIN-051} 88
- FailureRate {Machine=DNCENGWIN-120} 93
After having a look in Kusto, DNCENGWIN-006 doesn't appear to be broken, it just failed the most recent few work items. Same story for DNCENGWIN-051. DNCENGWIN-120 appears to be broken, will offline it and create an ICM
icm: https://portal.microsofticm.com/imp/v3/incidents/details/336719202/home
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-006} 100
- FailureRate {Machine=DNCENGWIN-051} 90
- FailureRate {Machine=DNCENGWIN-058} 83
- FailureRate {Machine=DNCENGWIN-120} 94
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-006} 83
- FailureRate {Machine=DNCENGWIN-120} 94
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-120} 94
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-006} 100
- FailureRate {Machine=DNCENGWIN-120} 94
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-006} 100
- FailureRate {Machine=DNCENGWIN-120} 94
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-006} 100
:green_heart: Metric state changed to ok
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
:broken_heart: Metric state changed to alerting
Description and instructions for this alert
Please note that this alert will fire every 12 hours as the list of machines can change while the alert is alive. So please keep an eye on the list of machines in the comment.
- FailureRate {Machine=DNCENGWIN-006} 100