xuqiang

Results 1 issues of xuqiang

1. The initialization parameters for `DotProductAttention` and `TEDotProductAttention` are different. Using DotProductAttention to construct MultiLatentAttention will result in an error. 2. In the `MLASelfAttention` module, the dimensions of `k_pos_emb` and...