DisCo
DisCo copied to clipboard
How to use DeepSpeed for multi-GPU training instead of using mpirun?