container-linux-update-operator icon indicating copy to clipboard operation
container-linux-update-operator copied to clipboard

operator: expose metrics

Open lucab opened this issue 8 years ago • 0 comments

update-operator is a long running Go process which supervises cluster-wide complex operations. As such it should expose metrics regarding its status, which can be scraped by Prometheus and alerted upon. Access to such endpoint should be governed by kubernetes RBAC policies.

This is a preliminary list of interesting metric:

  • go runtime stats
  • nodes being managed by CLUO
  • nodes in reboot-needed state
  • nodes in before-reboot state
  • nodes in after-reboot state
  • optional "before" and "after" checks state

lucab avatar Nov 10 '17 15:11 lucab