kpmokpmo

Results 1 issues of kpmokpmo

Hi, it seems Linear SRA works better with fewer params on PVT-V2-B2. Could you please show more results when applying this Attention to other model variants?