Houmin Wei
Houmin Wei
Paper: https://zhuangwang93.github.io/docs/Gemini_SOSP23.pdf
SigComm 2023 ref: https://dl.acm.org/doi/pdf/10.1145/3603269.3604823
paper: https://qiangsu97.github.io/files/conext22-final43.pdf
Paper: - https://www.usenix.org/conference/nsdi23/presentation/liu-kefei
Paper: - https://arxiv.org/pdf/1904.03257.pdf
Paper: - https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf Video: - https://www.bilibili.com/video/BV1YA4y197G8
Paper: - https://arxiv.org/pdf/1909.08053.pdf - https://arxiv.org/pdf/2104.04473.pdf - https://arxiv.org/pdf/2205.05198.pdf Reading by 李沐: - https://www.bilibili.com/video/BV1nB4y1R7Yz Github: - https://github.com/NVIDIA/Megatron-LM Blog: - https://huggingface.co/blog/zh/megatron-training - https://juejin.cn/post/7064496967828635655 Slide from NVIDIA: - https://developer.download.nvidia.cn/video/gputechconf/gtc/2020/presentations/s21496-megatron-lm-training-multi-billion-parameter-language-models-using-model-parallelism.pdf
Paper: - https://arxiv.org/abs/1712.05889v2 - OSDI 2018 https://www.usenix.org/system/files/osdi18-moritz.pdf Presentation: - https://www.usenix.org/conference/osdi18/presentation/moritz - https://www.youtube.com/watch?v=qD4KoeB0RiA
paper: https://arxiv.org/abs/2202.07848
NSDI19: https://www.usenix.org/conference/nsdi19/presentation/shu