mamba
mamba copied to clipboard
Parameter number problem of Mamba2
I did not do bidirectional processing inside mamba2 (the same as Vision Mamba), I did a bidirectional work outside of the class, but the whole model has a large number of parameters, I would like to ask how to solve this, anyone has done bidirectional work inside Mamba2 for reference.