SensatUrban icon indicating copy to clipboard operation
SensatUrban copied to clipboard

Acceleration Training

Open ZouhairBALLOUCH opened this issue 3 years ago • 5 comments

Hi Sir ; Is there a possibility to speed up the training, because for me each epoch needs more than 3 hours
Thank you very much in advance

ZouhairBALLOUCH avatar Jul 13 '21 06:07 ZouhairBALLOUCH

Hi, did you solve your problem? I met a similar problem where I could only run the code on 1 gpu and the gpu memory used was only 161MB. It took 4 full days, almost 100 hours, to train 100 epochs in total.

R1CH4RDL avatar Jul 15 '21 03:07 R1CH4RDL

Hi; not yet , could you please share with me the pre-trained model that you have obtained Best regards

ZouhairBALLOUCH avatar Jul 19 '21 14:07 ZouhairBALLOUCH

i had the same problem at one time. The cuda library was not loaded but it didn't show any warning or error. The program continued with CPU only. However, this issue didn't show again the day after. I have no idea what caused this issue

xianyu-wang avatar Mar 03 '22 06:03 xianyu-wang

Hi, did you solve your problem? I met a similar problem where I could only run the code on 1 gpu and the gpu memory used was only 161MB. It took 4 full days, almost 100 hours, to train 100 epochs in total.

hello . I met a similar problem. could you please share with me the pre-trained model that you have obtained Best regards

helloCZZ avatar Mar 23 '22 12:03 helloCZZ

Hi, I had the same issue, I used the colab pro+ which has tesla V100, but it doesn't use GPU to train as well, so every epoch also took me 3 hours. Then I tried GPUSHARE which is similar to AWS, my VM has 3090 and AMD EPYC 7601, every epoch took me 1.5 hours, which is quite decent so far. tho it is still using CPU to train.

AlberTgarY avatar Jun 21 '22 04:06 AlberTgarY