self-hosted icon indicating copy to clipboard operation
self-hosted copied to clipboard

Drastic drop in high frequency alerts

Open boxinegmbh opened this issue 1 year ago • 1 comments

Self-Hosted Version

24.1.1

CPU Architecture

x86_64

Docker Version

25.0.0

Docker Compose Version

1.29.2

Steps to Reproduce

Upgrade from 23.3.1 to 24.1.1 Full docker volume backup was done Restore was successful

Expected Result

Uninterrupted functionality of the configured alerts.

Actual Result

After upgrade, steep decline in triggered alerts. Tried setting up the alerting behaviour again but no change observed Screenshot from 2024-02-06 16-35-42

Alert settings Screenshot from 2024-02-06 17-17-25

Event ID

No response

boxinegmbh avatar Feb 06 '24 15:02 boxinegmbh

Did you do the hard stop at 23.6.2 when upgrading?

azaslavsky avatar Feb 07 '24 18:02 azaslavsky

This issue has gone three weeks without activity. In another week, I will close it.

But! If you comment or otherwise update it, I will reset the clock, and if you remove the label Waiting for: Community, I will leave it alone ... forever!


"A weed is but an unloved flower." ― Ella Wheeler Wilcox 🥀

getsantry[bot] avatar Feb 29 '24 08:02 getsantry[bot]

We upgraded from v23.3.1 to v23.4.0 to manage the postgres database upgrade Then v23.4.0 to v23.9.1 and did the full docker volume backup Then v23.9.1 to v23.11.1 used hardware with recommended requirements Finally v23.11.1 to v24.1.1 when we saw this issue Performed the Recovery steps suggested in the docs https://develop.sentry.dev/self-hosted/troubleshooting/#recovery We did not perform the nuclear option since we wanted to recover all issues with no/minimal data loss What exactly is the Hard stop?

boxinegmbh avatar Feb 29 '24 09:02 boxinegmbh

A hard stop is a version of self-hosted Sentry that must be upgraded to before continuing. Are there any logs in your web container that might be able to tell us what might be going on? Looks like alerts are not being triggered.

hubertdeng123 avatar Mar 01 '24 22:03 hubertdeng123

After investigation, seems like sentry was amplifying the impact of errors (impacting 50% of sessions), after the upgrade the drastic drop is observed. The number of errors did not drastically change before and after upgrade. Current assumption is the alerts based on the rate of sessions was wrong before upgrade to v24.1.1

boxinegmbh avatar Mar 05 '24 09:03 boxinegmbh