Kai-Hsun Chen
Kai-Hsun Chen
@PKU-Fgx, maybe you can profile Ray core worker processes and compare different memory dumps to see how much memory they contribute.
@wzq016 you profiles the GCS process. You can profile core worker processes.
@PeterSH6, would you mind adding a "ray" label to this issue? I am triaging Ray-related issues in veRL. Thanks!
Can you use `submitterPodTemplate` instead?
RayJob has improved a lot in KubeRay v1.1.0, so I’m not surprised that there are some stability issues in v1.0.0. However, I am surprised that KubeRay v1.1.1 also has the...
@ryanaoleary is this issue ready to close?
Thanks, @lixmgl! Some tips to make the integration possible: * We aim to maintain KubeRay's lightweight nature. Therefore, please minimize the addition of Yunikorn-specific code to the KubeRay core. *...
The RayCluster CRD can be considered a set of Kubernetes ReplicaSets (with each head or worker group similar to a ReplicaSet). Therefore, we aimed to make the observability consistent with...
Do you use KubeRay? In KubeRay, you can directly edit the min / max replicas of worker groups.
I would recommend using KubeRay instead if you are able to launch a K8s cluster.