mos icon indicating copy to clipboard operation
mos copied to clipboard

Results 6 mos issues
Sort by recently updated
recently updated
newest added

Hi, When I try to reproduce the results, I find that the model converges ~100 epoch and the valid ppl is 65.32, which is much higher than the published results....

Hi Mr.Dai, It seems the performance discrepancy has lead to slightly worse performance on the penn treebank in the readme page. I am able to reproduce the result for the...

Hi, I was trying to run MoS on WikiText-103 and 1B word dataset. I wonder if you have used adaptive softmax, such as this paper **_Efficient Softmax Approximation for GPUs_**,...

the current version of ``--cuda`` is store_false, maybe it would be better to change it

Hi! Thanks for sharing this code base! Do you have a pre-trained model that we could use? We want to test the idea of incorporating a language model into our...

I am running this code on my Linux server, however it seems that this python program is only runned by one of my computer's cpu and leaves two thirds total...