Simiao Zuo

Results 1 comments of Simiao Zuo

I believe the default parameter means batch size 2048, warmup 4000, and learning rate 2.0. In my case, I set batch size 128, warmup 128000, and learning rate 8.0. It...