Xiao-Yun Zhou
Results
2
issues of
Xiao-Yun Zhou
In the paper, it is said alpha should be 0.99 at the beginning (when global_step is small) and should be 0.999 at the end (when global_step is large), however, in...
Hi, Thank you for contributing this fantastic code. It helps me a lot. However, as I am learning deeper and deeper, I found that the "wd, bd, weight, and bias"...
bug
help wanted