Abhi Agg
Results
2
comments of
Abhi Agg
@kimiyoung @zihangdai Following up, It seems that by default, the zeroing of upper triangular matrix is False. https://github.com/kimiyoung/transformer-xl/blob/master/pytorch/mem_transformer.py#L194 What is the reason for that?