Kai-Hsun Chen

Results 327 comments of Kai-Hsun Chen
trafficstars

Hi @aybidi, @andrewsykim and I will sync next week to discuss this issue. I will keep you updated.

Hi @HCharlie @aybidi, @andrewsykim and I are working on a document: https://docs.google.com/document/d/1hCJsrCFYPJLS3Zusdr8N_4Y5leWUMy4bQEbsqSQp2mw/edit?usp=sharing. It is still WIP, but feel free to comment and share your feedback.

@yx367563 great! It targets on KubeRay v1.3.0 which will be out before Ray Summit.

@yx367563 you can also join our community sync: https://calendar.google.com/calendar/u/0?cid=Y19iZWIwYTUxZDQyZTczMTFmZWFmYTY5YjZiOTY1NjAxMTQ3ZTEzOTAxZWE0ZGU5YzA1NjFlZWQ5OTljY2FiOWM4QGdyb3VwLmNhbGVuZGFyLmdvb2dsZS5jb20 KubeRay has a bi-weekly community sync. In addition, we will also track the progress of the "Streamline RayJob UX" in the...

Honestly, I don't think KubeRay should handle and expose K8s Pod errors. You can think of RayCluster as equivalent to multiple ReplicaSets. [ReplicaSetStatus](https://pkg.go.dev/k8s.io/api/apps/v1#ReplicaSetStatus) doesn't include "Pod failure" in its status....

@MadhavJivrajani Great! I will let you know when I have a doc.

I have already worked on a document. I will let you know when it is ready for review.

Hi @han-steve @MadhavJivrajani, I have scheduled a meeting for the RayCluster status improvement work stream on July 10 8:30 - 8:55 AM PT. You can add the following Google calendar...

#2271 introduces a new condition `RayClusterReady` and we will gradually deprecate `.Status.State`. The definition is: * `RayClusterReady` indicates whether all Ray Pods are ready when the RayCluster is first created....