qd-3dt About DistributedDataParallel

About DistributedDataParallel

Open cijj opened this issue 2 years ago • 0 comments

Hi, I can see that the source code only use non_distributed training even with multiple GPUs training. Is there any special reason why you use non_distributed training?

Oct 12 '22 05:10 cijj

qd-3dt qd-3dt copied to clipboard

About DistributedDataParallel

qd-3dt
qd-3dt copied to clipboard