ScaledYOLOv4 icon indicating copy to clipboard operation
ScaledYOLOv4 copied to clipboard

OMP_NUM_THREADS error.

Open saikrishnadas opened this issue 4 years ago • 5 comments

I get an error while trying to use distributed training. I have 4 GPUs(Tesla T4) and error shows when using a p7 model. Tried switching to single GPU and same error occurs. But it works with csp model with one gpu.

Error log : **Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed.**

saikrishnadas avatar Dec 17 '20 04:12 saikrishnadas

this is not error message, all of ddp training will show this default message.

WongKinYiu avatar Dec 18 '20 00:12 WongKinYiu

But my training process is exited after showing this.

saikrishnadas avatar Dec 18 '20 05:12 saikrishnadas

Do you have a solution for this? This usually happens when I use more than one GPU with p7 model @WongKinYiu

saikrishnadas avatar Dec 22 '20 16:12 saikrishnadas

I also encounter with this problem, @saikrishnadas did you got any solution for this?

RohitKeshari avatar Aug 12 '21 13:08 RohitKeshari

My training isn't even killed. It just freezes before it begins.

marcelampc avatar Mar 02 '22 13:03 marcelampc