kpmokpmo
Results
1
issues of
kpmokpmo
Hi, it seems Linear SRA works better with fewer params on PVT-V2-B2. Could you please show more results when applying this Attention to other model variants?