nanoGPT icon indicating copy to clipboard operation
nanoGPT copied to clipboard

Multi GPUs training is very slow

Open zscwind opened this issue 2 years ago • 2 comments

I used 4 GPUs on 1 node: torchrun --standalone --proc_per_node=4 train.py --compile=False But, the training speed is just like 1 GPU,why?

zscwind avatar Feb 08 '23 14:02 zscwind

Hello, have you solved it? I have the same problem as you

han508 avatar Mar 31 '23 08:03 han508

Check => https://pytorch.org/docs/stable/amp.html

sertannavdann avatar Mar 31 '23 20:03 sertannavdann