Mark pod as failed if it is in ContainerCreating state for "x" amount of time
Describe the solution you'd like [A clear and concise description of what you want to happen.] AKS ImageCleaner user request: In order to allow for cleanup and new ImageJobs to be scheduled, allow users to set a timeout for ImageJob to be marked as failed with pods stuck in ContainerCreating state for "x" amount of time.
Anything else you would like to add: [Miscellaneous information that will assist in solving the issue.]
Environment:
- Eraser version:
- Kubernetes version: (use
kubectl version):
If pod is stuck due to an issue on the node, a work around here can be to use the eraser.sh/cleanup.filter to skip that node so it doesn't run into the issue on each repeat period of ImageJob.
cc @WilliamRockwellEvans