LookaheadOptimizer-mx icon indicating copy to clipboard operation
LookaheadOptimizer-mx copied to clipboard

Does this implementation maintain the momentum?

Open zhangtj1996 opened this issue 5 years ago • 1 comments

For optimizers like sgd+momentum, adam, rmsprop, they may use the historical information of the gradients. Does this implementation maintain / reset / interpolate the momentum in each outer loop?

zhangtj1996 avatar Dec 13 '19 09:12 zhangtj1996

Thank you for pointing it out!

This implementation doesn't reset the momentum in outer loop. I will try to fix it.

wkcn avatar Jun 28 '20 14:06 wkcn