lifecycle-manager
lifecycle-manager copied to clipboard
Graceful AWS scaling event on Kubernetes using lifecycle hooks
**FEATURE REQUEST**: **What happened**: Draining & removing nodes from ASG also requires us to update the monitoring systems to stop monitoring the node and a few other things before the...
**Is this a BUG REPORT or FEATURE REQUEST?**: May be bug or some workaround needed **What happened**: Not working while during spot instance termination. Also checked this https://github.com/keikoproj/lifecycle-manager/issues/18#issuecomment-547652760. Is it...
It was noticed when a huge spike of terminating instances happen (150 instances + another 150 instances after few minutes), we may not have enough goroutines or do not prioritize...
Related #35 Currently we limit hearbeat extensions up to 1hr. In some use-cases it might need to be configurable. We should introduce a new flag `--processing-timeout-seconds` to take in max...
In very large clusters with many target groups, we may want to 'batch' terminations so that we do not work on more than N instances at the same time. When...
It was seen on clusters with massive numbers of ALBs/Target Groups (300-400 ALBs + 400-500 Target groups), things start to break down - once controller starts getting throttled very heavily,...
- Should add a new metric called `received_events_total`. - it seems that abandoned instances due to timeout do not count as a failed event.
Some metrics published under `/metrics` should have labels associated with them to indicate metadata such as autoscaling group name, etc. We should think of relevant labels and annotate relevant metrics
we should have a basic functional test that verifies the following statements: - can receive a lifecycle hook and drain a node - can receive a lifecycle hook and deregister...
Bumps [github.com/aws/aws-sdk-go](https://github.com/aws/aws-sdk-go) from 1.48.11 to 1.51.12. Release notes Sourced from github.com/aws/aws-sdk-go's releases. Release v1.51.12 (2024-04-01) Service Client Updates service/datazone: Updates service API, documentation, and paginators service/deadline: Updates service API, documentation,...