Andrei Atanov

Results 3 comments of Andrei Atanov

Hi @utjune , sorry for the very late reply. Do you use dp or ddp?

does it only happen with 8 gpus, what if you use less gpus? what is your batch size?

Thanks for letting me know; this is weird; I'll try to test it on my end when I can access an 8-GPU node.