denoising-diffusion-pytorch icon indicating copy to clipboard operation
denoising-diffusion-pytorch copied to clipboard

Train on Multi-GPU

Open tsWen0309 opened this issue 10 months ago • 3 comments

When I try to train this model on multi-gpu using Accelerate, an error occurs image How to solve it and is there any default Accelerate config that I can refer to? Please excuse my foolishness.

tsWen0309 avatar Aug 09 '23 02:08 tsWen0309

@Flu0XeT1n want to try 1.8.9 and see if it works now?

lucidrains avatar Aug 09 '23 02:08 lucidrains

@Flu0XeT1n want to try 1.8.9 and see if it works now?

I tried 1.8.9. The original problem is solved but a new one occurs. image

tsWen0309 avatar Aug 09 '23 12:08 tsWen0309

that sounds like an error with the way you have cuda configured

lucidrains avatar Aug 09 '23 13:08 lucidrains