large-batch-training icon indicating copy to clipboard operation
large-batch-training copied to clipboard

Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"

Results 7 large-batch-training issues
Sort by recently updated
recently updated
newest added

It seems like keras has changed the parameter of the function `BatchNormalization`. The error msg: ``` Traceback (most recent call last): File "plot_parametric_plot.py", line 64, in model = network_zoo.shallownet(nb_classes) File...

hi Nitish, Can you release the code of computing sharpness? I want to use the metric in my paper.

Hi Nitish, Could you include the code for computing sharpness as well. Thanks, Neelesh

Hi @keskarnitish, I would like to ask a question that how much is a small batch and how much is a large batch in the real problems. For example, in...

Is there any problem that makes the implementation of GPU version difficult? I tried to get a linear combination of SB weights and LB weights in GPU mode, and got...

I run `python plot_parametric_plot.py -n C1`, and get following error: ``` Traceback (most recent call last): File "plot_parametric_plot.py", line 64, in model = network_zoo.shallownet(nb_classes) File "/home//github/users/wenwei202/large-batch-training/network_zoo.py", line 37, in shallownet...

Is there a Caffe implementation? Thanks!