variance_reduced_neural_networks
variance_reduced_neural_networks copied to clipboard

Published 20 hours ago •

Reame
Issues

Why is the norm of the grad not used at all in the optimisation process of SAGA?

Open rohan1561 opened this issue 2 years ago • 0 comments

Where exactly is the equation (3) in the main paper implemented in the SAGA algorithm?

Sep 13 '22 19:09 rohan1561