mamba icon indicating copy to clipboard operation
mamba copied to clipboard

Some questions about the shape of A,B,C,D

Open Aristo23333 opened this issue 6 months ago • 8 comments

Dear Authors, Thanks for your brilliant works! Now I am learning about the detailed change of parameter shape in your code and in your paper. I noticed that the A is (D,N), which represents a NN matrix with diagonal structure, right? It seems to represent there are D NN matrixes for D channels? The same situation for D. But I get some confused about the B, C which is (B,L,N) because they seems not to contain the information of different channls. Are they all the same for D channels? Thank you!

Aristo23333 avatar Aug 13 '24 07:08 Aristo23333