DeepSpeed icon indicating copy to clipboard operation
DeepSpeed copied to clipboard

Issue in multi-node training with Slurm

Open macabdul9 opened this issue 2 years ago • 0 comments

I am trying to train models on multiple nodes with DeepSpeed. Any resource for that?

Seems like this PR #2404 was merged into the main but can't find any documentation on how to use it. Kindly help. cc: @tjruwase @RezaYazdaniAminabadi @HeyangQin

macabdul9 avatar May 01 '23 13:05 macabdul9