continuous-integration icon indicating copy to clipboard operation
continuous-integration copied to clipboard

Implement sensible timeout for long-running tasks

Open fweikert opened this issue 2 years ago • 3 comments

I had to cancel https://buildkite.com/bazel/bazel-at-head-plus-downstream/builds/2193 after 13 hours (!). The culprit was rules_nodejs (rbe_ubuntu1604): The first task timed out after 8 hours, the second one had been running for 5.5 hours when I cancelled it.

fweikert avatar Sep 30 '21 09:09 fweikert

(Assigned Yun to the wrong issue)

fweikert avatar Sep 30 '21 10:09 fweikert

I'm seeing more hanging jobs at https://buildkite.com/bazel/bazelisk-plus-incompatible-flags/builds/1024#_

It looks like the task has been finished, but the job didn't terminate.

/cc @philwo Any idea what could cause this?

meteorcloudy avatar Oct 07 '21 08:10 meteorcloudy

I'm tracking the hanging jobs issue in https://github.com/bazelbuild/continuous-integration/issues/1244 👍🏻

philwo avatar Oct 07 '21 12:10 philwo