XNOR-Net-PyTorch No speedup and memory saving on CIFAR10

No speedup and memory saving on CIFAR10

Open guangzhili opened this issue 7 years ago • 8 comments

I have played around with CIFAR10 and also done a bit benchmark. It seems BinOp does not have noticeable effect on model size and inference speed compared to NIN model without BinOp. I have tested both on CPU and GPU. I thought the saved model nin.pth.tar would shrink, and the inference would speed up significantly. Do I miss something? Does anyone have this issue? Thanks.

Jan 31 '18 08:01 guangzhili

It’s because BinOp is still 32 or 16 bits. There is no packing optimization here. Sad.

Jan 31 '18 10:01 fenollp

@fenollp Thanks for your reply. What would you suggest to do if I would like to achieve the binary optimization? modify PyTorch core?

Jan 31 '18 15:01 guangzhili

@guangzhili useful discussion here here And a GPU kernel based on TensorFlow can be found here But unfortunately, the acceleration is NOT significant. As for compression, I think it's easy to implement.