Christian Lorentzen comments

Results 252 comments of


                                            Christian Lorentzen

FEA add Cholesky based Newton solver to GLMs

> > Note to myself: For tiny alpha like 1e-12, it raises LinAlgWarning: Ill-conditioned matrix and the results disagree a little bit. lbfgs seems to find a solution with just...

FEA add Cholesky based Newton solver to GLMs

> Here are some notes from a first quick code review. As discussed with @GaelVaroquaux and @agramfort it would be interesting how this solver compares with an equivalent of the...

FEA add Cholesky based Newton solver to GLMs

> With regards to Newton-cg vs Newton-cholesky: the difference should be marked in higher dimensions. It would be interesting to have a benchmark with p in the hundreds. X.shape =...

FEA add Cholesky based Newton solver to GLMs

With https://github.com/scikit-learn/scikit-learn/pull/23314/commits/e6684c6bd1c42074216fe6d8fe51f60caff05c2e, I completed the tests for unpenalized GLMs. Same as in #22910. Based on those, we can better investigate how to handle singular `X` with these Newton solvers.

FEA add Cholesky based Newton solver to GLMs

The last commits contain substantial updates: - https://github.com/scikit-learn/scikit-learn/pull/23314/commits/3fb36954b75492ee94f249ce0a6638cf48d2ed64 fixed the test around `glm_dataset`. The GLM tests now take between 1-2 minutes (on my laptop). - I tried an SVD based...

FEA add Cholesky based Newton solver to GLMs

The new tests are **very** hard. I had to switch of `TweedieRegressor(power=3.0)` and `TweedieRegressor(power=0, link="log")`. Furthermore, https://github.com/scikit-learn/scikit-learn/pull/23314/commits/c9b120063574bdd9d408805ce96c34ebe7722f81 and https://github.com/scikit-learn/scikit-learn/pull/23314/commits/2f0ea15a2089861170b744e8a531708100c9ff88 introduce a fallback inner solver to 4 lbfgs iterations. This is...

FEA add Cholesky based Newton solver to GLMs

Those are interesting findings. The real datasets used above all have several categorical values which produce diagonal sub-blocks of the hessian. It seems lbfgs is just slower on those and...

FEA add Cholesky based Newton solver to GLMs

I put the tests in a separate PR, #23619, to make this PR smaller once that is merged.

FEA add Cholesky based Newton solver to GLMs

I'll push some commits in a minute.

FEA add Cholesky based Newton solver to GLMs

@ogrisel I had a little time. Test tolerances are fixed in https://github.com/scikit-learn/scikit-learn/pull/23314/commits/5e6aa9974a862db813e1300d5cf0b47a6aed23a5. The fallback to lbfgs is done in ~~https://github.com/scikit-learn/scikit-learn/pull/23314/commits/5e6aa9974a862db813e1300d5cf0b47a6aed23a5~~ https://github.com/scikit-learn/scikit-learn/pull/23314/commits/d4206d68247ca1b439657be31ce9d86250a53d8a.