gdGPT icon indicating copy to clipboard operation
gdGPT copied to clipboard

Any plan to incorporate tensor parallelism or zero data parallelism?

Open GeneZC opened this issue 2 years ago • 2 comments
trafficstars

Would it be possible in this framework that the pipeline is incorporated to tensor parallelism or zero data parallelism?

GeneZC avatar Sep 14 '23 02:09 GeneZC

Hi,

Thanks for being interested in this repo !!!

Is there any experiments post that adding tensor parallelism or zero would improve training performance ?

CoinCheung avatar Sep 18 '23 06:09 CoinCheung

Not really.

However, there are projects that use pipeline with tensor parallelism together for efficiency such like megatron. And I believe this project offers a better solution since it only depends on deepspeed without heavy dependencies as in megatron.

As for pipeline with zero, I have not seen any other projects did this.

GeneZC avatar Sep 19 '23 11:09 GeneZC