papers-notebook
papers-notebook copied to clipboard
ZeRO-Offload: Democratizing Billion-Scale Model Training
https://arxiv.org/abs/2101.06840
https://www.deepspeed.ai/news/2021/03/07/zero3-offload.html
我的笔记:https://github.com/Jack47/hack-SysML/blob/master/papers/ZeRO-Offload.md