mongeoroo
Results
1
comments of
mongeoroo
when training, here happens 'Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to'
I also experienced the same warning. But even with the warning, the network was trained well and the test accuracy is the same as the figure reported in the paper.