PyTorch-LBFGS
PyTorch-LBFGS copied to clipboard
A PyTorch implementation of L-BFGS.
updated line 257 to remove the warning: overload of add_ is deprecated
Hi Michael, I wanted to share a stochastic first order optimizer idea and implementation for L-BFGS, in case you might find it interesting or even practical in your research (congrats...
Dheevatsa, have you seen any errors of the following sort? ``` Traceback (most recent call last): File "lbfgs_tests.py", line 114, in obj, grad, lr, backtracks, clos_evals, grad_evals, fail = optimizer.step(p,...
This question has been bothering me for a long time I wonder how to determine the direction of the gradient
Hello @hjmshi , Recently I stumbled upon [the Bits&Bytes wrapper](https://gist.github.com/TimDettmers/c4ffe346f095ee4481aa3d4b4ad2ffe0). Since then I can't help but wonder whether we could adapt this idea for the second order methods, namely, 8-bit...
I'm trying FullBatchLBFGS with wolfe line search on a fitting task of a small dataset. 1. Levenbert-Marquardt is giving me fairly accuracy. Can I expect LBFGS provide similar accuracy as...
The error occurs in functions/LBFGS.py, line 854. I think this error comes from t. It becomes a double precision number with the process of convergence. I tried ``` if(F_new >...
Thanks for this implementation. Recently, I've been working on a package to use machine learning for chemistry problems where I use pytorch to train some models. I have been able...