geeps icon indicating copy to clipboard operation
geeps copied to clipboard

how to enable multi-gpu training in each node

Open Dav-Jay opened this issue 8 years ago • 2 comments

Thanks for the excellent work!

When I tried the provided train_inception.sh for CIFAR10 on two nodes, I found only one GPU is invoked in each node (GPU 0), is there a way to use multiple gpus in each node? In original Caffe, I can use --gpu=0,1,2,3 which seems not work in geeps.

Dav-Jay avatar Dec 01 '16 15:12 Dav-Jay

Seems current geeps does not support multi-gpu in each node?

AIROBOTAI avatar Dec 02 '16 08:12 AIROBOTAI

Unfortunately, the current GeePS implementation does not support multi-gpu in each node. Feel free to implement that and create a pull request :)

cuihenggang avatar Dec 14 '16 19:12 cuihenggang