Pretrained-Language-Model icon indicating copy to clipboard operation
Pretrained-Language-Model copied to clipboard

Why only teacher_model is applied DistributedDataParallel in general_distill.py ?

Open 1024er opened this issue 5 years ago • 0 comments

image

I am not familiar with pytorch's DistributedDataParallel, and I am confused that why only teacher_model is applied DistributedDataParallel in general_distill.py ?

1024er avatar Mar 29 '20 13:03 1024er