yjzhong111

Results 1 comments of yjzhong111

> 1. It should work for multi-node training. > 2. No timeline for deepspeed. why do you need deepspeed for training? Thanks! But how can I train in multi-node, is...