Paul Michel
Paul Michel
Hi @bing0037 I haven't run this code in a while but it used to work. My first guess would be probably an incompatibility with a newer version of pytorch. Can...
After investigating a bit on this I can make an educated guess on the reason: There is a possibility that the attention weights returned by the transformerDecoderLayer are not masked...
It should be fixed by #226 which hasn't been merged yet
Hi Shuang, thanks for bringing this to my attention. I fixed the link in the README, you should now be able to access the dataset page at https://pmichel31415.github.io/sated/ and the...
Sorry for the late reply and thanks for bringing this up! What is the error message exactly when the program ends? Does it nor find the data even though the...
Got it. I will keep this issue open then, so people who encounter the same issue can see the solution. Maybe if I have time later I will push a...
Hmm, could be an issue with cached files still containing the original trigger tokens... Which files contain the new trigger tokens vs the old? Can you try deleting the files...
I'm interested in helping for this, how would I go about doing it?
@msperber I have a question regarding serialization in general: say I so what you said (make DevLossTracker and TrainLossTracker Serializable), will this be backward compatible? As in will I be...
That's good to know