Jiao

Results 15 comments of Jiao

Which version of Ray you're using in your remote cluster ? Can you check by ```python import ray print(ray.__version__) print(ray.__commit__) ```

@zhanyuanucb ah sorry for the late reply as i was on paternity leave. My recommendation from the perspective of Ray maintainer -- just don't use Ray client. We don't recommend...

It is still quite necessary in my opinion, since our OPT-175B benchmarks are executed in Nvidia's Selene with NVLink and NVSwitch you mentioned, pipeline parallelism is still essential for scaling,...

@GHGmc2 thanks for the clarification -- this image is much clearer, previously i thought you meant 4th gen NVLink. I think it will change the optimal hyper parameters of (data...

Hi @EricSteinberger, thanks for providing context for the issue. You're testing a workload that we think is critical but haven't invested much yet, therefore some edges could be rough. I...