transformer-xl
transformer-xl copied to clipboard
why i-j always>0
is it some mask mechanism as in transformer decoder