Mostafa Badawy comments

Repositories
Issues
Comments

Results 5 comments of


                                            Mostafa Badawy

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation

Which Pytorch version do you use?

Number of flops

Because my implementation of group convolution is based on straightforward tensorflow graph operations. However, I think the authors used a fast cuDNN implementation based on caffe.

Number of flops

You're welcome. Also, note that the group convolution operator hasn't been implemented yet officially in Tensorflow.

Number of flops

@Pelups I have an update to this. I found out that the authors have counted mult-add as 'one' unit not as 'two' units. So, the number I have achieved if...

How's the ImageNet training

Sorry for the late reply. I've trained on TinyImageNet-200 and the loss was very small after converging. On ImageNet, I stopped the training because I was very busy doing my...