k8s-pod-restart-info-collector icon indicating copy to clipboard operation
k8s-pod-restart-info-collector copied to clipboard

feat: Ignore specific errors for given pods

Open nikup opened this issue 7 months ago • 2 comments

Add support for ignoring a specific restart error for a specific type of pod. Example: We have a gke-metrics-agent that sometimes can't start because of Failed to run the service: failed to start extensions: listen tcp 127.0.0.1:8203: bind: address already in use. With the current implementation we can ignore that by adding

ignoredErrorsForPodNamePrefixes = "{ "gke-metrics-agent": [ "address already in use" ]}"

nikup avatar Nov 13 '23 12:11 nikup

@able8 any chance of getting this in?

ventsislav-georgiev avatar Dec 01 '23 18:12 ventsislav-georgiev

Hi @nikup @ventsislav-georgiev, thank you for contributing. I left 2 comments. Also, the Contributor License Agreement (CLA) is required to merge PR. Refer to https://github.com/airwallex/k8s-pod-restart-info-collector/blob/master/CONTRIBUTION.md

able8 avatar Dec 06 '23 05:12 able8