powersgd
powersgd copied to clipboard
Pull request for follow-up work
I am submitting a PR because I hope my work being included in the selected follow-up work of PowerSGD :) This is a pull request for follow-up work, Optimus-CC (ASPLOS'23), which utilizes PowerSGD for gradient compression in 3D parallelism-based LLM training.