Xiao-Yun Zhou

Results 2 issues of Xiao-Yun Zhou

In the paper, it is said alpha should be 0.99 at the beginning (when global_step is small) and should be 0.999 at the end (when global_step is large), however, in...

Hi, Thank you for contributing this fantastic code. It helps me a lot. However, as I am learning deeper and deeper, I found that the "wd, bd, weight, and bias"...

bug
help wanted