mcr2
mcr2 copied to clipboard
After several rounds, the loss function will become nan.
logdet value becomes nan, when I used the UCR dataset named FaceDetecion. I chosed the BERT model to deal with the multiple timeseries data. I wonder why this happened?