LookaheadOptimizer-mx Does this implementation maintain the momentum?

Does this implementation maintain the momentum?

Open zhangtj1996 opened this issue 5 years ago • 1 comments

For optimizers like sgd+momentum, adam, rmsprop, they may use the historical information of the gradients. Does this implementation maintain / reset / interpolate the momentum in each outer loop?

Dec 13 '19 09:12 zhangtj1996

Thank you for pointing it out!

This implementation doesn't reset the momentum in outer loop. I will try to fix it.

Jun 28 '20 14:06 wkcn

LookaheadOptimizer-mx LookaheadOptimizer-mx copied to clipboard

Does this implementation maintain the momentum?

LookaheadOptimizer-mx
LookaheadOptimizer-mx copied to clipboard