cosmos-validator-mission-control icon indicating copy to clipboard operation
cosmos-validator-mission-control copied to clipboard

Identify tool combinations to implement the spec

Open cfl0ws opened this issue 5 years ago • 3 comments

I'm currently leaning toward a combination of Prometheus and Icinga to implement the spec. Prometheus is well-proven and familiar to the Cosmos community.

I've been looking at and experimenting with Icinga for a while. It seems to be an effective open-source monitoring and alerting tool.

Other tools mentioned in the past by community members include -

Graylog - Log monitoring

Zabbix - Network, server, cloud and application monitoring

ELK - Network operations monitoring

Next steps -

  • [x] Research a few more additional platforms

  • Cacti*

  • Graphite*

  • Icinga*

  • LibreNMS

  • OpenNMS

  • Nagios Core*

  • NetXMS

  • PandoraFMS

  • Grafana/Prometheus*

  • Sensu

  • Zabbix*

  • [x] Shortlist platforms

  • Icinga*

  • Grafana/Prometheus*

  • Zabbix*

  • [X] Choose platforms

  • Icinga

  • Grafana/Prometheus

cfl0ws avatar Jan 08 '20 18:01 cfl0ws

I've shortlisted the tools to -

  • Icinga
  • Prometheus
  • Zabbix

Next step will be to map each to the requirements docs to identify the best fit or fits, if a combination of tools may be required.

cfl0ws avatar Jan 22 '20 17:01 cfl0ws

After doing some additional research, it seems like a combination of Prometheus and Icinga is the way to go. If this combination falls short, I'll look at Zabbix again.

cfl0ws avatar Jan 24 '20 21:01 cfl0ws

It turns out that Icinga was unnecessary, as we were able to customize Grafana/Prometheus to send the alerts we needed to send, using a custom-built alerting module, which will be open-sourced with the rest of the tool.

cfl0ws avatar Apr 01 '20 14:04 cfl0ws