deep-gradient-compression
deep-gradient-compression copied to clipboard
Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"
Deep Gradient Compression
Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"
Installation
for installing required packages run
pip3 install -r requirements.txt
Run project
python main.py
Implementation
Current implementation consist of only
- large gradients selection and update
- small gradients accumulation
- momentum corelation
- momentum factor masking
References
DEEP GRADIENT COMPRESSION:REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING Pytorch tutorial on distributed training