EfficientNet-PyTorch
EfficientNet-PyTorch copied to clipboard
Gradient Checkpointing
Does anyone have any thoughts on where and how to add gradient checkpoints in this implementation to reduce memory usage during training?