Ziyu Chen
Results
1
comments of
Ziyu Chen
The problem may come with DDP settings. The PyTorch DDP [notes about **Backward Pass**](https://pytorch.org/docs/master/notes/ddp.html) “so after the backward pass, the grad field on the same corresponding parameter across different DDP...