Mohammad Reza Samsami
Results
1
comments of
Mohammad Reza Samsami
Based on your comments, I trained S4 models with more layers and a larger `d_model` and `d_state`. I used _convolution mode_ to update the parameters and _recurrent mode_ to generate...