Andrew Sy Kim
Andrew Sy Kim
@kevin85421 @rueian do you know if this bug overlaps with any of the RayService refactorign we're doing now?
I've encountered similar issues in the past from my testing, see https://github.com/ray-project/kuberay/issues/2355 We also increased the exec probe timeout for Head pod to 5s, so I am also open to...
Longer term, we really need to remove dependency to exec probes, I believe that once we are using HTTP probes, we can use shorter timeouts with significantly better reliability. There's...