xuqiang
Results
1
issues of
xuqiang
1. The initialization parameters for `DotProductAttention` and `TEDotProductAttention` are different. Using DotProductAttention to construct MultiLatentAttention will result in an error. 2. In the `MLASelfAttention` module, the dimensions of `k_pos_emb` and...