DeepSpeedExamples icon indicating copy to clipboard operation
DeepSpeedExamples copied to clipboard

Is there any example with recent version of Megatron-LM?

Open cryoco opened this issue 3 years ago • 2 comments
trafficstars

The examples showed here or here is based on versions about half a year ago. Is there any examples aligned with recent Megatron? Or, is there still relatively obvious optimization with deepspeed to Megatron with pipeline parallelization now?

cryoco avatar Feb 15 '22 12:02 cryoco

Hi but the second page says that it is updated on March 21, 2022. I am new to Deepspeed and learning it. I don't know whether it is true.

zhaowei-wang-nlp avatar Mar 22 '22 11:03 zhaowei-wang-nlp