Results 1 issues of Yichun Shi

Thanks for your contribution! I found some problems when using the code: 1. The learning rate is supposed to decay to 0.01xinitial_lr at the end but it almost never changed...