Dinghao Zhou

Results 152 comments of Dinghao Zhou

> Is there any solution to fix it? Maybe related issues: https://github.com/pytorch/pytorch/issues/27971#issuecomment-543067718

closed due to not activating for long time

reopen this issue if there are still problems

may some oom occurs in training, ```bash num workers * gpus

Are there any relevant references? thx

贴class下边 好奇完整的epoch跑完会咋样 这个作用是加速收敛呢 还是最终效果也会变好

后边会支持模型并行, moe这里需要特殊的处理, 看到了这个 参考下 截个图放这里 ref:https://zhuanlan.zhihu.com/p/681154742