foreman-infra
foreman-infra copied to clipboard
Set up a Monitoring System
Our infrastructure nodes need monitoring. We don't want jobs failing because they Jenkins nodes are running out of disk space. Also notes that are down need to be checked.
Since I do not recall the context of this I am closing.
IMHO this is still needed so I've provided context.
IMHO this is still needed so I've provided context.
Did you forget to hit send on the comment with the context?
I modified the original message (from no message to some).
Discussed during the infra sig: https://community.theforeman.org/t/infrastructure-sig-meeting-notes-2022-01-14/26860#set-up-a-monitoring-systemhttpsgithubcomtheforemanforeman-infraissues471-17
There was a discussion about which system to use. Zabbix was mentioned by @Odilhao while @ekohl mentioned Icinga. The former has the benefit that Odilon has experience with it while the latter has a foreman_monitoring plugin (though a Zabbix backend could be written for smart_proxy_monitoring. It would also run on Netways infrastructure and Netways maintains Icinga.
Agreed on the requirements:
- It needs to implement disk checks since this is what we mostly run into. While every monitoring solution implements this, it should be seen as the focus point to start with.
- Puppetized since Foreman’s infrastructure is maintained with Puppet and we don’t want to manually deploy anything.
https://github.com/Icinga/icinga-packaging/issues/283 is blocking us from using Icinga 2.