SeaFormer
SeaFormer copied to clipboard
hi,I noticed that you comparison with different selfattention modules on ADE20K. val set based on Swin Transformer architecture in Table.4
did you mean pretrain on imagenet and finetune on ade20k when you replace a new attention module?