awd-lstm-lm
awd-lstm-lm copied to clipboard
Unpredictable behavior of adaptive softmax
The behavior of adaptive softmax is very unpredictable. Sometimes I can run through the whole code on dataset A at the first time, but got error message when training on dataset B with same format and schema. Then, if I switch back to dataset A, the code failed again. Here is the error message:
Traceback (most recent call last):
File "main.py", line 244, in
This issue has blocked me for a long time. Please review it, thanks!
PS: after getting the error first time, I'm not able to run through any data except dataset with vocabulary < 75000, which result in regular softmax. Thus, something should be fixed in splitcross.py.
Can you try on the commit https://github.com/salesforce/awd-lstm-lm/tree/bf0742cab41d8bf4cd817acfe7e5e0cbff4131ba ? If that works, I can help you with getting the improvements from that commit for low-vocabulary datasets.
I tried this and it seems to work. I am trying to use my own dataset (cantab-tedlium) to train using adaptive-softmax and it crashes often.