Jan Červenka
Results
1
issues of
Jan Červenka
In the gradient descent with momentum formula, I think the gradient function $L^\\prime$ should use $x^{[t-1]}$ to cumpute the new delta. $$ \\Delta_x^{[t]} = lr \\cdot L^\\prime(x^{[t-1]}) + p \\cdot...