Le Duc Manh comments

Results 12 comments of


                                            Le Duc Manh

[Feature][RayService] Handle serve deployment delete during the cluster destroy.

Is there any updates regarding this feature?

[Feature][RayService] Handle serve deployment delete during the cluster destroy.

Hi @kevin85421. We are considering using KubeRay and Ray Serve for our production model servers. We want to have async feature. We plan to utilize FastAPI backgrounds tasks for running...

[Feature][RayService] Handle serve deployment delete during the cluster destroy.

I'm willing to try providing the PR for the fix as well. But I'm gonna need some helps to start with how and where to fix.

[Feature][RayService] Handle serve deployment delete during the cluster destroy.

> Do you mean: the user sends a request → a Ray Serve replica triggers a heavy workload → it returns a response without waiting for the heavy workload to...

[Feature][RayService] Handle serve deployment delete during the cluster destroy.

I setup a long running endpoint (sleep for 5 minutes) and can see that the request got hang up during cluster rotation. It seems that regular requests are not drained...

[Feature][RayService] Handle serve deployment delete during the cluster destroy.

After taking a look at the code, it seems the current logic is delete the old cluster after 60 seconds wait ([ref](https://github.com/ray-project/kuberay/blob/master/ray-operator/controllers/ray/rayservice_controller.go#L565)) 1 possible fix I could think of is...