Yashovardhan Chaturvedi

Results 1 comments of Yashovardhan Chaturvedi

https://stats.stackexchange.com/questions/284712/how-does-the-l-bfgs-work/285106 . I think since we are using L-BFGS we should not be calling optimizer.zero_grad() after each minibatch and let it accumulated for several minibatch and than do the update...