elasticsearch icon indicating copy to clipboard operation
elasticsearch copied to clipboard

Investigate whether we can use "reconcileTasks" method rather than custom healthcheck message

Open philwinder opened this issue 10 years ago • 3 comments

The healthcheck sends a custom framework message to request a status update. We might be able to use the mesos method reconcileTasks().

philwinder avatar Aug 06 '15 14:08 philwinder

wait for refactoring #327

jhftrifork avatar Sep 29 '15 09:09 jhftrifork

FWIW, We experienced an issue where an executor task went lost while the scheduler was also down, and we were unable to recover gracefully from this. I assume this is due to the executor healthcheck being done by the scheduler directly and not by Mesos.

justinclayton avatar Apr 15 '16 22:04 justinclayton

@justinclayton Thanks Justin. We've just had a similar issue reported with #550. We can't decide whether this is our responsibility. I.e. whether we should add more code to work around the fact that Mesos can't handle this failure.

philwinder avatar Apr 18 '16 07:04 philwinder