cnn-models icon indicating copy to clipboard operation
cnn-models copied to clipboard

train on multi gpus

Open liyu10000 opened this issue 5 years ago • 2 comments

Hi,

I tried to finetune with your model. It works well on single gpu but not on multi gpus. Is it a inhere limitation or there is something I need to work around? Thanks.

liyu10000 avatar Feb 19 '19 09:02 liyu10000

Hi!

Which model are you using? Multi GPU shouldn't make a difference for VGG or AlexNet, but can make a difference for ResNet. For ResNet, please make sure that each GPU gets at least 8, better 16, images of each batch.

Another thing to take care of is the learning rate. It should be scale linearly with the batch size, which you probably scale when using multiple GPUs.

Am 19. Februar 2019 18:47:07 schrieb Li Yu [email protected]:

Hi, I tried to finetune with your model. It works well on single gpu but not on multi gpus. Is it a inhere limitation or there is something I need to work around? Thanks. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

MarcelSimon avatar Feb 21 '19 03:02 MarcelSimon

Thank you for your response. I finally found that there is something wrong with the environment. I tried to run it on another server and it worked like a charm.

liyu10000 avatar Feb 22 '19 03:02 liyu10000