mxnet-SSH
mxnet-SSH copied to clipboard
training speed
I run training code, but the speed is quite slow. only 20 images/sec even I set batch_image =32 on 1 GPU or 4 GPU even larger. Could you give some comments ?
CPU is the performance bottleneck. We may need a refined c++ data loader to boost the speed.