Matias Häggman

Results 3 comments of Matias Häggman

I am also having this issue, with both the slstm and the mlstm. After quick debugging I found the 'gates' tensor to be filled with nans in the forward pass...

> we should be able to construct a combination architecture using both sLSTM and mLSTM blocks, infact that is exactly how the original research paper suggest we use xLSTM. >...

I have this problem aswell. This only occurs if you include the sLSTM module in the xLSTM stack, using only mLSTM works. I tested this on the lightning platform with...