DeepSpeedExamples
DeepSpeedExamples copied to clipboard
BERT in Megatron-LM-v1.1.5-3D_parallelism does not support pipeline parallelism
I try to run the BERT with pipeline parallelism, but I get an error:
File "DeepSpeedExamples/Megatron-LM-v1.1.5-3D_parallelism/pretrain_bert.py", line 146, in
I dig into the code a little bit, it seems like the pipeline parallelism is not implemented for BERT.
Hi @eddy16112 , thanks for your interest in 3D parallelism! At this time we have not adapted BERT to support pipeline parallelism. Only the GPT code path is supported.