Yashovardhan Chaturvedi
Results
1
comments of
Yashovardhan Chaturvedi
https://stats.stackexchange.com/questions/284712/how-does-the-l-bfgs-work/285106 . I think since we are using L-BFGS we should not be calling optimizer.zero_grad() after each minibatch and let it accumulated for several minibatch and than do the update...