Dinghao Zhou
Dinghao Zhou
> Is there any solution to fix it? Maybe related issues: https://github.com/pytorch/pytorch/issues/27971#issuecomment-543067718
closed due to not activating for long time
reopen this issue if there are still problems
Any update on this?
支持了吗
may some oom occurs in training, ```bash num workers * gpus
Are there any relevant references? thx
Any update on this?
贴class下边 好奇完整的epoch跑完会咋样 这个作用是加速收敛呢 还是最终效果也会变好
后边会支持模型并行, moe这里需要特殊的处理, 看到了这个 参考下 截个图放这里 ref:https://zhuanlan.zhihu.com/p/681154742