Xuanlong Yu
Xuanlong Yu
Hi, thank you for leaving the comments. Actually, it appears in the original bts training as well, you can check [here](https://github.com/cleinc/bts/issues/36) to find some suggestions given by the authors of...
Several common loss functions are indeed non-negative such as MSELoss or L1Loss. But it is fine to have a negative loss value during training, not only in LDU case. You...
Thank you for your comment. torch.cdist(a,a) calculates distance between _each_ pair of the two collections of row vectors. So it will output 0 on the diagonal of the output matrix,...