open_clip
open_clip copied to clipboard
How to estimate the memory usage of a gpu under the global loss setting?
I have noticed that under the global loss setting, when training with multiple nodes, the GPU memory usage will be much higher than when training with a single node. So, how should we estimate the GPU memory usage with multiple nodes in this situation?