Adji Bousso Dieng

Results 1 issues of Adji Bousso Dieng

Hi, Your code for adaptive softmax in splitcross.py is false. (1) For the tail, you are taking log_softmax twice and you are accounting for p(tombstone) twice. You are getting the...