Mostafa Elhoushi comments

Results 22 comments of


                                            Mostafa Elhoushi

Add batchnorm parameters: running_mean, running_var, and others

I have created a pull request to address this issue: https://github.com/sksq96/pytorch-summary/pull/88

count running_mean and running_var params for BN

Thanks @Naireen According to the link you sent provided, I am quoting the following: > Also by default, during training this layer keeps running estimates of its computed mean and...

Log Message for each Pass if it Changed the Graph

I am willing to implement this feature, but couldn't assign the issue to myself.

Low training accuracy using pre-trained model

Did you try to train for one epoch with learning rate = 0 (`--lr 0`)?

What accuracy should we expect when training Alexnet from scratch on ImageNet?

Just quoting from this [blog article](https://blog.paperspace.com/popular-deep-learning-architectures-alexnet-vgg-googlenet/): > The model uses a stochastic gradient descent optimization function with batch size, momentum, and weight decay set to 128, 0.9, and 0.0005 respectively....

What accuracy should we expect when training Alexnet from scratch on ImageNet?

Maybe try those hyperparameters, and if they lead to the expected accuracy, perhaps create a pull request to update the README file accordingly?

What accuracy should we expect when training Alexnet from scratch on ImageNet?

> So far our tests aren't in a place where we can guarantee some model performance, the case could be made that maybe we should? But so far we don't...

4-bits on ResNet18 results in 6% reduction in error

First of all, thanks for your quick response, I highly appreciate that! I originally tired with `epochs` setting kept at 4. I changed it to 5 and waiting for the...

4-bits on ResNet18 results in 6% reduction in error

Thanks Maxim for your detailed response. I have also tried training with 5 epochs per iteration and used the batch size and other hyperparameters they mentioned in their paper for...

4-bits on ResNet18 results in 6% reduction in error

> Hi @mostafaelhoushi, were you able to solve the problem in the end or no? > I am not able to reproduce the results for 2 bits weight for ResNet18....