faster-pytorch-blog icon indicating copy to clipboard operation
faster-pytorch-blog copied to clipboard

python 7_fabric.py no speedup (4 RTX 3090)

Open kli-casia opened this issue 2 years ago • 3 comments

image

Thanks~

kli-casia avatar Apr 27 '23 08:04 kli-casia

Hm, what's your baseline speed on a single GPU? And how many workers are you using in the dataloader?

rasbt avatar Apr 27 '23 13:04 rasbt

On a single GPU, the time is ~ 22 min. By the way, I git clone your code without changing a single line of code.
Thanks

kli-casia avatar Apr 28 '23 01:04 kli-casia

I see, that's weird. So basically you only get 18 min for 4 GPUs, where you get 22 min on a single GPU? That's definitely weird, I don't think I have a good explanation for this. Maybe one GPU was busy running something else at the same time, and it slowed down everything because the other GPUs had to wait for the sync step?

rasbt avatar May 04 '23 21:05 rasbt