ViTAE-VSA icon indicating copy to clipboard operation
ViTAE-VSA copied to clipboard

The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"

Results 8 ViTAE-VSA issues
Sort by recently updated
recently updated
newest added

In your code file `ViTAE-VSA\Image-Classification\vitaev2_vsa\NormalCell.py` L130: `self.pos = nn.Conv2d(dim, dim, window_size//2*2+1, 1, window_size//2, groups=dim, bias=True)` your `window_size`is 7,so the `self.pos` convolution kernel is 7 too, in most Positional Encoding extractor...

语义分割和 Swin+VSA 什么时候发布呀

hello, i am now doing a work about semantic segmentation and i would like to use swin with vsa model as the feature extractor backbone. I would like to ask,...

Hello, i would like to ask about the setting of img_size=(1,1) in line 36 vsa.py. Which size does it mean? thanks a lot.

I would like to ask how to use the provided weights pre-training file, .tar type zipped file extracted without a .pth file, but a folder

Hello! I hope to have the model that just adding vsa to swin transformer like said in the paper. Can you release the code? Thank you very much.