helm-charts icon indicating copy to clipboard operation
helm-charts copied to clipboard

[kube-prometheus-stack] Unable to get Grafana alerts working

Open mymasse opened this issue 3 years ago • 21 comments

Describe the bug a clear and concise description of what the bug is.

Unable to get any Grafana alerts sent to contact points. Create both a Slack and MS Teams contact point, set one at the root policy and created a alert rule.

Nothing gets sent to contact points.

What's your helm version?

version.BuildInfo{Version:"v3.7.0", GitCommit:"eeac83883cb4014fe60267ec6373570374ce770b", GitTreeState:"clean", GoVersion:"go1.16.8"}

What's your kubectl version?

Client Version: version.Info{Major:"1", Minor:"21", GitVersion:"v1.21.0", GitCommit:"cb303e613a121a29364f75cc67d3d580833a7479", GitTreeState:"clean", BuildDate:"2021-04-08T16:31:21Z", GoVersion:"go1.16.1", Compiler:"gc", Platform:"linux/amd64"} Server Version: version.Info{Major:"1", Minor:"21", GitVersion:"v1.21.2", GitCommit:"092fbfbf53427de67cac1e9fa54aaa09a28371d7", GitTreeState:"clean", BuildDate:"2021-06-16T12:53:14Z", GoVersion:"go1.16.5", Compiler:"gc", Platform:"linux/amd64"}

Which chart?

kube-prometheus-stack

What's the chart version?

19.0.1

What happened?

Nothing is sent to my contact points

What you expected to happen?

Alerts are sent to contact points

How to reproduce it?

  • Create a Slack contact point with a webhook
  • Make that contact point the root policy one and group by alertname
  • Create an alert rule with vector(1) > 0

Enter the changed values of values.yaml?

grafana:
  grafana.ini:
    feature_toggles:
      enable: ngalert

Enter the command that you execute and failing/misfunctioning.

minikube start --kubernetes-version=v1.21.2
helm install -f value.yml prom prometheus/kube-prometheus-stack

Anything else we need to know?

No response

mymasse avatar Oct 05 '21 19:10 mymasse

@mymasse it's a bug in Grafana < 8.1.5. I'm also waiting for the chart to update to lastest grafana to test if it's fixed

gaetansnl avatar Oct 12 '21 08:10 gaetansnl

Seems like the newer chart fixed part of the problem. Any Grafana defined alerts now are sent to contact points. However all the alerts in the Cortex/Loki group are still not working, for instance these:

image

These I think came pre-bundled with the chart, will show up in Alert-manager but other that the Alert Rules panel do not show up in the Alerts Groups when they are in the Firing state and are not propagated to contact points.

mymasse avatar Oct 25 '21 12:10 mymasse

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Any further update will cause the issue/pull request to no longer be considered stale. Thank you for your contributions.

stale[bot] avatar Nov 24 '21 17:11 stale[bot]

Keeping this open until I can test it with the latest chart version

mymasse avatar Nov 24 '21 17:11 mymasse

Thanks for raising this @mymasse.

I can confirm the Loki generated alerts unfortunately still don't go through to the defined Contact Point (in my case, Alertmanager) as of latest chart version 23.1.6.

Hope this gets a resolution soon, it has been quite a few chart version since the bug's been opened... and it can be a bit of a roadblock.

CEMario avatar Dec 10 '21 18:12 CEMario

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Any further update will cause the issue/pull request to no longer be considered stale. Thank you for your contributions.

stale[bot] avatar Jan 09 '22 22:01 stale[bot]

This issue is being automatically closed due to inactivity.

stale[bot] avatar Jan 23 '22 22:01 stale[bot]

Still interested in resolution of this bug so Loki alerts can be used

CEMario avatar Jan 27 '22 15:01 CEMario

Can this be reopened I can't do it myself

mymasse avatar Jan 27 '22 15:01 mymasse

I am also interested in the resolution of this issue. Please reopen again.

The versions in our system are: kube-prometheus-stack: 32.2.0 prometheus: 2.33.1 grafana: 8.3.6 alertmanager: 0.23.0 prometheus-operator: 0.54.0

raebbar avatar Feb 14 '22 12:02 raebbar

I'm interested in a resolution too. Please re-open this.

Andrea-Gallicchio avatar Feb 22 '22 15:02 Andrea-Gallicchio

Plus 1 for resolution. Is there a workaround or something ? Using the latest version.

milenbm avatar Mar 22 '22 08:03 milenbm

Found this statement on this issue from one of the Grafana team members: https://github.com/grafana/grafana/discussions/45773#discussioncomment-2235421

tldr: Not possible atm, but might be in the future :(

J12934 avatar Apr 08 '22 15:04 J12934

+1

I am also searching for a resolution. I'm using the latest version

Could it be that these alerts are managed by the alertmanager? My alertmanager shows Cluster Status: disabled under http://localhost:9093/#/status I am not able to enable this, I have found the following stackoverflow question describing my issue, but I don't know where I can enable this option. https://stackoverflow.com/questions/70470152/alertmanager-cluster-status-is-disabled

If this is unrelated I can open a new issue for this.

dickescheid avatar Apr 21 '22 15:04 dickescheid

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Any further update will cause the issue/pull request to no longer be considered stale. Thank you for your contributions.

stale[bot] avatar May 31 '22 02:05 stale[bot]

bump

dickescheid avatar Jun 13 '22 21:06 dickescheid

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Any further update will cause the issue/pull request to no longer be considered stale. Thank you for your contributions.

stale[bot] avatar Jul 14 '22 02:07 stale[bot]

Bump

mymasse avatar Jul 14 '22 02:07 mymasse

Bump

rotemsh15 avatar Aug 01 '22 16:08 rotemsh15

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Any further update will cause the issue/pull request to no longer be considered stale. Thank you for your contributions.

stale[bot] avatar Sep 16 '22 01:09 stale[bot]

bump

dickescheid avatar Sep 16 '22 09:09 dickescheid

Bump, issue still ongoing

fpetr avatar Oct 05 '22 06:10 fpetr

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Any further update will cause the issue/pull request to no longer be considered stale. Thank you for your contributions.

stale[bot] avatar Nov 09 '22 07:11 stale[bot]

This issue is being automatically closed due to inactivity.

stale[bot] avatar Nov 27 '22 14:11 stale[bot]

bump

Mici81 avatar Dec 22 '22 14:12 Mici81

I'm also looking for a clever solution to this issue. Is this related to the fact that Grafana's Alertmanager doesn't receive alerts from Loki as described here:

https://grafana.com/docs/grafana/latest/alerting/fundamentals/alertmanager/

Can you please reopen this @mrueg?

itay-grudev avatar Feb 07 '23 19:02 itay-grudev

Is this also related to: https://github.com/grafana/grafana/issues/39219?

itay-grudev avatar Feb 07 '23 19:02 itay-grudev

Actually reading through https://github.com/grafana/grafana/issues/39219 this is exactly the same issue. I think it should remain closed. Refer to the closing comment on the Grafana issue for details.

itay-grudev avatar Feb 07 '23 20:02 itay-grudev