Mohammad Reza Samsami

Results 1 comments of Mohammad Reza Samsami

Based on your comments, I trained S4 models with more layers and a larger `d_model` and `d_state`. I used _convolution mode_ to update the parameters and _recurrent mode_ to generate...