Andrei Atanov
Results
3
comments of
Andrei Atanov
Hi @utjune , sorry for the very late reply. Do you use dp or ddp?
does it only happen with 8 gpus, what if you use less gpus? what is your batch size?
Thanks for letting me know; this is weird; I'll try to test it on my end when I can access an 8-GPU node.