aws-node-termination-handler
aws-node-termination-handler copied to clipboard
Gracefully handle EC2 instance shutdown within Kubernetes
NTH does not report 0 when a metric has not been observed during runtime. Prometheus expects counters to report 0 when they are 0. If they are simply not reported,...
I've been using the NTH in queue processor mode. This implementation uses a lifecycle hook associated with the node instance to trigger the NTH to cordon/drain. Lifecycle hooks support two...
When using instance refresh to update ASGs it looks like the events come through with a start date of now which triggers the node-termination handler to start cordoning and draining...
Clarify that the rebalance rec only applies to Spot instances and that it's a separate event type than an AZ rebalance #416
**Describe the feature** It would be nice to add a [values.schema.json](https://helm.sh/docs/faq/changes_since_helm2/#validating-chart-values-with-jsonschema) after the [JSON schema deprecated flag ](http://json-schema.org/draft/2019-09/json-schema-validation.html#rfc.section.9.3)feature becomes [supported](https://github.com/helm/helm/issues/10732) in Helm (looks like [this PR ](https://github.com/helm/helm/pull/11340)needs to be merged...
**Describe the bug** Hi, In the logs right after the NTH starts we can see errors frequently like below ``` 2022/09/08 08:18:46 ERR Error when trying to list Nodes w/...
Missing v1.21.0 image on public ecr.
When the node-termination handler deletes the node, the Node transitions to the 'NotReady' state due to the unavailability of kubelet. Upon inspecting the logs, it appears that client-side throttling is...
**Describe the bug** The helm chart provides default webhook template for slack, i tried changing the template to that of discord's. But i am getting below error in logs :...
**Describe the bug** Lately, we saw in all of our environments that we suffer from repeated errors of: There was a problem while trying to cordon and drain the node,...