Cesc Chunseong Park
Results
3
comments of
Cesc Chunseong Park
3 stages mean 15 epochs, that's enough I think. It would be nice to test it by reducing the size of the model.
Did you set the same hyper-parameter settings as in the paper?
The code is for demonstrating examples. To make it work faster, there are smaller hyper-parameter values than the numbers in the paper. ex) TOPK = 3