variance_reduced_neural_networks icon indicating copy to clipboard operation
variance_reduced_neural_networks copied to clipboard

Why is the norm of the grad not used at all in the optimisation process of SAGA?

Open rohan1561 opened this issue 2 years ago • 0 comments

Where exactly is the equation (3) in the main paper implemented in the SAGA algorithm?

rohan1561 avatar Sep 13 '22 19:09 rohan1561