sentry icon indicating copy to clipboard operation
sentry copied to clipboard

‘Number of Errors’ Alert in Critical state for over a year

Open kpujjigit opened this issue 1 year ago • 1 comments

Environment

SaaS (https://sentry.io/)

Steps to Reproduce

It appears this Number of Errors Alert

Image

Image

Expected Result

State of Alert should change, and we in-fact see a change in state when we go to Edit the rule:

Image

Actual Result

Alert has been in a critical state since it was created a year ago; insight was lost for a client and spikes occurred without alerting to proper channels.

Product Area

Alerts

Link

No response

DSN

No response

Version

No response

kpujjigit avatar Aug 26 '24 23:08 kpujjigit

Auto-routing to @getsentry/product-owners-alerts for triage ⏲️

getsantry[bot] avatar Aug 26 '24 23:08 getsantry[bot]

@kpujjigit Hey there, could you link to the org and alert links somewhere? I don't see them on the issue rn

schew2381 avatar Aug 28 '24 17:08 schew2381

@kpujjigit We are still missing the link to org/alert for us to debug this.

leedongwei avatar Sep 03 '24 18:09 leedongwei

I was able to find the alert with a query. From looking at logs, the alert itself was setup in a way that it could never resolve.

The resolve threshold is set to resolve when # of errors is 100% lower than 1 week ago, but this means the number of errors at the current moment must essentially reach 0 in order resolve.

For example, we would reach this number very close to 100% if 1 week ago we had 9991 errors, and in the current moment we have 1

Image

The alert was resolved by editing it, but it still has the same resolve issue so once it triggers it will never resolve as well.

schew2381 avatar Sep 03 '24 18:09 schew2381