ViTAE-VSA
ViTAE-VSA copied to clipboard
The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"
In your code file `ViTAE-VSA\Image-Classification\vitaev2_vsa\NormalCell.py` L130: `self.pos = nn.Conv2d(dim, dim, window_size//2*2+1, 1, window_size//2, groups=dim, bias=True)` your `window_size`is 7,so the `self.pos` convolution kernel is 7 too, in most Positional Encoding extractor...
Swin+VSA
语义分割和 Swin+VSA 什么时候发布呀
@Qiming-Zhang1 @RogerZhangzz
hello, i am now doing a work about semantic segmentation and i would like to use swin with vsa model as the feature extractor backbone. I would like to ask,...
Hello, i would like to ask about the setting of img_size=(1,1) in line 36 vsa.py. Which size does it mean? thanks a lot.
I would like to ask how to use the provided weights pre-training file, .tar type zipped file extracted without a .pth file, but a folder
Hello! I hope to have the model that just adding vsa to swin transformer like said in the paper. Can you release the code? Thank you very much.