Abhi Agg

Results 2 comments of Abhi Agg

@kimiyoung @zihangdai Following up, It seems that by default, the zeroing of upper triangular matrix is False. https://github.com/kimiyoung/transformer-xl/blob/master/pytorch/mem_transformer.py#L194 What is the reason for that?