mamba
mamba copied to clipboard
Some questions about the shape of A,B,C,D
Dear Authors, Thanks for your brilliant works! Now I am learning about the detailed change of parameter shape in your code and in your paper. I noticed that the A is (D,N), which represents a NN matrix with diagonal structure, right? It seems to represent there are D NN matrixes for D channels? The same situation for D. But I get some confused about the B, C which is (B,L,N) because they seems not to contain the information of different channls. Are they all the same for D channels? Thank you!