Trainer icon indicating copy to clipboard operation
Trainer copied to clipboard

[Feature request] Multi-node training

Open yjzhong111 opened this issue 1 year ago • 2 comments

Hi, I have two questions:

  1. Can it be used in multi-node training?
  2. When will trainer support deepspeed? I have noticed that integrating deepspeed is in to-do list, but do you have the exact time or schedule?

Thank you!

yjzhong111 avatar Dec 18 '23 06:12 yjzhong111

  1. It should work for multi-node training.
  2. No timeline for deepspeed. why do you need deepspeed for training?

erogol avatar Dec 18 '23 07:12 erogol

  1. It should work for multi-node training.
  2. No timeline for deepspeed. why do you need deepspeed for training?

Thanks! But how can I train in multi-node, is there any instructions about it? For deepspeed, I may use some tricks in deepspeed to improve the training performance.

yjzhong111 avatar Dec 18 '23 08:12 yjzhong111