Adji Bousso Dieng
Results
1
issues of
Adji Bousso Dieng
Hi, Your code for adaptive softmax in splitcross.py is false. (1) For the tail, you are taking log_softmax twice and you are accounting for p(tombstone) twice. You are getting the...