alertmanager icon indicating copy to clipboard operation
alertmanager copied to clipboard

Alarm repeat even if set repeat_interval='4h'

Open jiaxinyang1 opened this issue 3 years ago • 4 comments

What did you do? that's my alertmanager configure:

route:
  group_by: [ 'msf_cluster', 'alertname', 'msf_service' ]
  group_wait: 1m
  group_interval: 5m
  repeat_interval: 4h
  receiver: msfmanagement-notification
receivers:
- name: msfmanagement-notification
  webhook_configs:
    - url: http://msfmanagement-notification.msfmanagement.svc.cluster.local:80/v1/alerts
inhibit_rules:
- source_match:
    eventLevel: "ERROR"
  target_match:
    eventLevel: "NOTICE"
  equal: ['msf_cluster', 'alertname', 'msf_service']

this is one of my rule:

    - alert: high_cpu_error
      expr: sum (msf_pod_cpu_usage_ms{msf_service_pod_container!="register"} / (msf_pod_cpu_limit_ms{msf_service_pod_container!="register"}>0) ) by (msf_cluster,msf_service,msf_service_pod,msf_service_pod_container)*100 > 95
      for: 3m
      labels:
        eventType: "HIGH_CPU"
        eventLevel: "ERROR"
        producer: "Alertmanager"
      annotations:
        summary: "Service high cpu usage"
        description: {{`"Service {{$labels.msf_service}} instance {{ $labels.msf_service_pod }} container {{ $labels.msf_service_pod_container }} cpu usage more than 95%"`}}

What did you expect to see?

I want to get repeating alarm every 4 hours。 What did you see instead? Under which circumstances? But i get repeating alarm in a short time


Service apm-db-bridge-system-cluster01 instance system-apm-apm-db-bridge-ddd568847-g2mkq container service cpu usage more than 70%      2022-03-21 08:05:35.574153+00

Service apm-db-bridge-system-cluster01 instance system-apm-apm-db-bridge-ddd568847-g2mkq container service cpu usage more than 70%      2022-03-21 08:00:35.572802+00

The interval is almost a group_interval

Environment

  • System information:

    insert output of uname -srm here

  • Alertmanager version:

alertmanager, version 0.20.0 (branch: HEAD, revision: f74be0400a6243d10bb53812d6fa408ad71ff32d)
  build user:       root@00c3106655f8
  build date:       20191211-14:13:14
  go version:       go1.13.5
  • Prometheus version:
prometheus, version 2.18.2 (branch: HEAD, revision: a6600f564e3c483cc820bae6c7a551db701a22b3)
  build user:       root@130a411dd4ff
  build date:       20200609-09:05:58
  go version:       go1.14.4
  • Alertmanager configuration file:
insert configuration here
  • Prometheus configuration file:
insert configuration here (if relevant to the issue)
  • Logs:
insert Prometheus and Alertmanager logs relevant to the issue here

jiaxinyang1 avatar Mar 21 '22 08:03 jiaxinyang1

HI @jiaxinyang1, thanks for reporting. It is not clear to me if the labels of the group are all are the same.

You've defined your group to be: group_by: [ 'msf_cluster', 'alertname', 'msf_service' ], but in your message:

Service apm-db-bridge-system-cluster01 instance system-apm-apm-db-bridge-ddd568847-g2mkq container service cpu usage more than 70%

I'm not able to tell, which is which? It's important to note that if any of the labels in the group by clause is different, it is treated as a separate group and will trigger the cycle of wait -> interval -> repeat again.

gotjosh avatar Mar 21 '22 11:03 gotjosh

thanks. this is my query result

{msf_cluster="cluster01",msf_service="apm-rest-bridge-apmstage-cluster01",msf_service_pod="apm-apm-rest-bridge-68d9bf5f7b-h8mhm",msf_service_pod_container="service"} | 6.908789499999999

Service apm-db-bridge-system-cluster01 instance system-apm-apm-db-bridge-ddd568847-g2mkq container service cpu usage more than 70%

msf_cluster="cluster01" All metrics for this label are unique values msf_service="apm-db-bridge-system-cluster01"

i don't set label 'msf_service_pod' and label 'msf_service_pod_container' in my group_by .will the same tag content be divided into different groups?

jiaxinyang1 avatar Mar 22 '22 00:03 jiaxinyang1

Are you running a single Alertmanager instance or several? Have you enabled the debug log level and checked the logs? If not, please do so and share the logs if you can't find anything.

simonpasquier avatar Mar 24 '22 13:03 simonpasquier

thank you . @simonpasquier

yes. i'am running a single alertmanager .only one Endpoint address in my prometheus Runtime Information i enabled the debug log . i change the alertmanager config reduced repeat_interval to test .

global:
  resolve_timeout: 5m
  http_config: {}
  smtp_hello: localhost
  smtp_require_tls: true
  pagerduty_url: https://events.pagerduty.com/v2/enqueue
  hipchat_api_url: https://api.hipchat.com/
  opsgenie_api_url: https://api.opsgenie.com/
  wechat_api_url: https://qyapi.weixin.qq.com/cgi-bin/
  victorops_api_url: https://alert.victorops.com/integrations/generic/20131114/alert/
route:
  receiver: msfmanagement-notification
  group_by:
  - msf_cluster
  - alertname
  - msf_service
  group_wait: 1m
  group_interval: 5m
  repeat_interval: 20m
inhibit_rules:
- source_match:
    eventLevel: ERROR
  target_match:
    eventLevel: NOTICE
  equal:
  - msf_cluster
  - alertname
  - msf_service
receivers:
- name: msfmanagement-notification
  webhook_configs:
  - send_resolved: false
    http_config: {}
    url: http://msfmanagement-notification.msfmanagement.svc.cluster.local:80/v1/alerts
templates: []

I choose a rule to observe

alert: high_mem_critical
expr: sum
  by(msf_cluster, msf_service, msf_service_pod, msf_service_pod_container) (msf_pod_memory_usage_Mb{msf_service_pod_container!="register"}
  / (msf_pod_memory_limit_Mb{msf_service_pod_container!="register"} > 0))
  * 100 > 95
for: 3m
labels:
  eventLevel: CRITICAL
  eventType: HIGH_MEM
  producer: Alertmanager
annotations:
  description: Service {{$labels.msf_service}} instance {{ $labels.msf_service_pod
    }} container {{ $labels.msf_service_pod_container }} memory usage more than 95%
    ,values is {{$value}}
  summary: Service high mem usage

this is the alertmanager log

level=debug ts=2022-03-25T02:23:21.777Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T02:24:21.777Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T02:25:21.776Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T02:27:21.788Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T02:29:21.778Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T02:29:21.782Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T02:34:21.778Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T02:34:21.783Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T02:36:21.780Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T02:37:21.772Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T02:39:21.779Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T02:41:21.786Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T02:42:21.786Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T02:43:21.777Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T02:44:21.777Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T02:47:21.787Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T02:51:21.784Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T02:52:21.785Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T02:53:21.783Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T02:55:21.778Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T02:57:21.785Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T02:59:21.783Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:00:21.784Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T03:01:21.784Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T03:05:21.781Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:05:21.785Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T03:07:21.788Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:09:21.788Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:10:21.785Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T03:11:21.788Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T03:15:21.781Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:15:21.786Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T03:16:21.785Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T03:20:21.785Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:20:21.787Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T03:21:21.787Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T03:25:21.787Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T03:26:21.803Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:27:21.804Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T03:28:21.790Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:30:21.839Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T03:32:21.804Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T03:34:23.703Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:35:21.801Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T03:35:21.801Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T03:44:21.801Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:45:21.802Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T03:46:21.802Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:48:21.798Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T03:49:21.802Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T03:50:21.802Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T04:06:21.765Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:07:21.767Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T04:08:21.764Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:10:21.773Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:12:21.765Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:12:21.768Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T04:13:21.757Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T04:15:21.785Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T04:17:21.768Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T04:17:21.769Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T04:22:21.769Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T04:32:21.784Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:33:21.785Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T04:34:21.776Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:36:21.778Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T04:38:21.785Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T04:40:21.783Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:41:21.784Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T04:42:21.786Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:44:21.773Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T04:46:21.784Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T04:49:21.775Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:50:21.776Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T04:51:21.774Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:53:21.783Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T04:55:21.776Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T04:57:21.795Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T04:58:21.796Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T04:59:21.788Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:01:21.779Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:03:21.783Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:03:21.796Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T05:05:21.789Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:07:21.785Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:08:21.796Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T05:09:21.792Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T05:13:21.797Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T05:14:21.782Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:15:21.782Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T05:16:21.791Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:18:21.785Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:20:21.783Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T05:20:21.783Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:22:21.798Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:24:21.787Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:25:21.783Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T05:26:21.791Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:28:21.794Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:30:21.784Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T05:30:21.785Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:32:23.711Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:34:21.790Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:35:21.784Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T05:36:21.794Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:38:21.796Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:40:21.785Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T05:40:21.802Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:42:21.807Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:44:21.792Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T05:45:21.786Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T05:54:21.812Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:55:21.813Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T05:56:21.804Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T05:58:21.804Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:00:21.813Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T06:00:21.818Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:02:21.798Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:04:21.792Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:05:21.814Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T06:06:21.765Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T06:10:21.769Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:10:21.814Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T06:12:21.772Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:14:21.781Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:15:21.815Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T06:16:27.909Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:18:21.768Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:20:21.778Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T06:20:21.815Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T06:24:21.769Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:25:21.770Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T06:25:21.771Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T06:30:21.770Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]
level=debug ts=2022-03-25T06:30:21.773Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:32:21.775Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:34:21.768Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][active]
level=debug ts=2022-03-25T06:35:21.770Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][active]]
level=debug ts=2022-03-25T06:35:23.804Z caller=dispatch.go:135 component=dispatcher msg="Received alert" alert=high_mem_critical[05d39fd][resolved]
level=debug ts=2022-03-25T06:40:21.772Z caller=dispatch.go:465 component=dispatcher aggrGroup="{}:{alertname=\"high_mem_critical\", msf_cluster=\"cluster01\", msf_service=\"apm-mes-system-cluster01\"}" msg=flushing alerts=[high_mem_critical[05d39fd][resolved]]

I get two alerts at those time:

"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 02:24:07.820329+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 02:42:07.838205+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 02:52:07.8077+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 03:00:07.810456+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 03:27:07.813749+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 03:45:07.811464+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 04:07:07.7669+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 04:33:07.772855+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 04:50:07.74466+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 04:58:07.763306+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 04:41:07.799701+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 05:15:07.759754+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 05:40:07.731338+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 05:55:07.749747+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 06:25:07.691202+00"
"apm-mes-system-cluster01"	"{instance=system-apm-apm-mes-75fffc9dc5-fsqtm,msf_service_pod_container=service}"	"2022-03-25 06:35:07.692441+00"

jiaxinyang1 avatar Mar 25 '22 06:03 jiaxinyang1