eraser icon indicating copy to clipboard operation
eraser copied to clipboard

Mark pod as failed if it is in ContainerCreating state for "x" amount of time

Open ashnamehrotra opened this issue 2 years ago • 2 comments

Describe the solution you'd like [A clear and concise description of what you want to happen.] AKS ImageCleaner user request: In order to allow for cleanup and new ImageJobs to be scheduled, allow users to set a timeout for ImageJob to be marked as failed with pods stuck in ContainerCreating state for "x" amount of time.

Anything else you would like to add: [Miscellaneous information that will assist in solving the issue.]

Environment:

  • Eraser version:
  • Kubernetes version: (use kubectl version):

ashnamehrotra avatar Nov 13 '23 21:11 ashnamehrotra

If pod is stuck due to an issue on the node, a work around here can be to use the eraser.sh/cleanup.filter to skip that node so it doesn't run into the issue on each repeat period of ImageJob.

ashnamehrotra avatar Nov 15 '23 21:11 ashnamehrotra

cc @WilliamRockwellEvans

ashnamehrotra avatar Nov 15 '23 21:11 ashnamehrotra