draino icon indicating copy to clipboard operation
draino copied to clipboard

Question about "timed out waiting for evictions to complete: timed out"

Open vukor opened this issue 6 years ago • 5 comments

Periodically got in logs:

INFO kubernetes/eventhandler.go:155 Failed to drain {"node": "ip-x-x-x-x.ec2.internal", "error": "timed out waiting for evictions to complete: timed out", "errorVerbose": "timed out\ntimed out waiting for evictions to complete\ngithub.com/planetlabs/draino/internal/kubernetes.(*APICordonDrainer).Drain\n\t/go/src/github.com/planetlabs/draino/internal/kubernetes/drainer.go:189\ngithub.com/planetlabs/draino/internal/kubernetes.(*DrainingResourceEventHandler).cordonAndDrain.func1\n\t/go/src/github.com/planetlabs/draino/internal/kubernetes/eventhandler.go:154\nruntime.goexit\n\t/usr/local/go/src/runtime/asm_amd64.s:1357"}

Looks like after timeout failed draino ignore the node, but I'm not sure. As I understand draino is stateless app, so will draino try again to drain this node?

vukor avatar Nov 08 '19 08:11 vukor

^ +1

ch-andremercer avatar Jun 03 '20 22:06 ch-andremercer

any solution you got maybe from configuration side? @vukor

tarunptala avatar Feb 11 '21 03:02 tarunptala

Same error here...did you get any solution @vukor please ?

yogeek avatar Apr 15 '21 20:04 yogeek

Same error here...did you get any solution @vukor please ?

nope, periodically see error messages in planetlabs/draino:b788331

vukor avatar Apr 16 '21 08:04 vukor

adding these extraArgs seems to have helped:

extraArgs:
  - evict-daemonset-pods
  - evict-emptydir-pods
  - evict-unreplicated-pods

I had to make some adjustment to the deployment template to get these parsed correctly by helm

          {{- range $key, $value := .Values.extraArgs }}
            - {{ if $value }}--{{ $value }}{{ end }}
          {{- end }}

willshu avatar Jun 14 '21 22:06 willshu