foreman-infra icon indicating copy to clipboard operation
foreman-infra copied to clipboard

Set up a Monitoring System

Open ehelms opened this issue 7 years ago • 6 comments

Our infrastructure nodes need monitoring. We don't want jobs failing because they Jenkins nodes are running out of disk space. Also notes that are down need to be checked.

ehelms avatar Feb 21 '18 02:02 ehelms

Since I do not recall the context of this I am closing.

ehelms avatar Nov 03 '21 16:11 ehelms

IMHO this is still needed so I've provided context.

ekohl avatar Nov 03 '21 16:11 ekohl

IMHO this is still needed so I've provided context.

Did you forget to hit send on the comment with the context?

ehelms avatar Nov 03 '21 16:11 ehelms

I modified the original message (from no message to some).

ekohl avatar Nov 03 '21 16:11 ekohl

Discussed during the infra sig: https://community.theforeman.org/t/infrastructure-sig-meeting-notes-2022-01-14/26860#set-up-a-monitoring-systemhttpsgithubcomtheforemanforeman-infraissues471-17

There was a discussion about which system to use. Zabbix was mentioned by @Odilhao while @ekohl mentioned Icinga. The former has the benefit that Odilon has experience with it while the latter has a foreman_monitoring plugin (though a Zabbix backend could be written for smart_proxy_monitoring. It would also run on Netways infrastructure and Netways maintains Icinga.

Agreed on the requirements:

  • It needs to implement disk checks since this is what we mostly run into. While every monitoring solution implements this, it should be seen as the focus point to start with.
  • Puppetized since Foreman’s infrastructure is maintained with Puppet and we don’t want to manually deploy anything.

ekohl avatar Jan 14 '22 17:01 ekohl

https://github.com/Icinga/icinga-packaging/issues/283 is blocking us from using Icinga 2.

ekohl avatar Sep 28 '23 12:09 ekohl