governor
governor copied to clipboard
A collection of cluster reliability tools for Kubernetes
**Is this a BUG REPORT or FEATURE REQUEST?**: feature request **What happened**: Currently, when node drain timeout, node-reaper will mark the node uncordon, and then try to drain it again...
In the case where there are multiple NATs in a zone, and that zone is cordoned, we may restore a different NAT. We should use tagging or similar approach to...
**Is this a BUG REPORT or FEATURE REQUEST?**: BUG REPORT **What happened**: Node reaper got stuck on processing a dud node that didn't exist in AWS but was still in...
**Is this a BUG REPORT or FEATURE REQUEST?**: enhancement **What happened**: When node reaper tries to drain a node, it does not check if all pods can be disrupted and...
it has lots of options and functions and the readme does not mention any of them should be improved to encourage adoption and for users to know what they install
We need to refactor node reaper, it started as a simple script, but as we add more and more logic, it seems the current structure is becoming a bit flakey....
governor packages should have a end to end functional test. we can have a travis cronjob that runs nightly and run this test test should: - setup eks cluster -...
unit-test coverage is around 50%, should get it >75% at the very least.
Pod reaper should support some sort of rule configuration to make it much more dynamic. Instead of using flags, we can load a configuration file that contains something like: ```...