External-Attention-pytorch issues

shuffletransformer

add shuffletransformer module

the danet attention model, when i change teh h w of feature,it is wrong

if __name__ == '__main__': input=torch.randn(3,256,7,7) danet=DAModule(d_model=256,kernel_size=3,H=7,W=7) print(danet(input).shape) input=torch.randn(3,256,7,7) when i change input to 3,256,128,128),wrong why?

henbucuoshanghai

会考虑实现deformable self-attention嘛？

3

zhijieshen-bjtu

CondConv and DynamicConv are the same

1

I think, there are some problem with CondConv implementation because it's almost the same as for DynamicConv and differs from the original paper's architecture (according to the paper, it should...

OlgaChaganova

可以把论文PDF也放在这个库里么，这样会不会更方便一点（如果在不会侵犯什么著作权的情况下）

1

Moviw

WeightedPermuteMLP代码中的Linear问题？

WeightedPermuteMLP 中采用了几个全连接层Linear，具体代码位置在ViP.py中的21-23行 ```python self.mlp_c=nn.Linear(dim,dim,bias=qkv_bias) self.mlp_h=nn.Linear(dim,dim,bias=qkv_bias) self.mlp_w=nn.Linear(dim,dim,bias=qkv_bias) ``` 这几个线性层的输入输出通道数都是dim，即输入输出的通道数不变在forward时，除了mlp_c是直接输入了x没有什么问题 ```python def forward(self,x) : B,H,W,C=x.shape c_embed=self.mlp_c(x) S=C//self.seg_dim h_embed=x.reshape(B,H,W,self.seg_dim,S).permute(0,3,2,1,4).reshape(B,self.seg_dim,W,H*S) h_embed=self.mlp_h(h_embed).reshape(B,self.seg_dim,W,H,S).permute(0,3,2,1,4).reshape(B,H,W,C) w_embed=x.reshape(B,H,W,self.seg_dim,S).permute(0,3,1,2,4).reshape(B,self.seg_dim,H,W*S) w_embed=self.mlp_w(w_embed).reshape(B,self.seg_dim,H,W,S).permute(0,2,3,1,4).reshape(B,H,W,C) weight=(c_embed+h_embed+w_embed).permute(0,3,1,2).flatten(2).mean(2) weight=self.reweighting(weight).reshape(B,C,3).permute(2,0,1).softmax(0).unsqueeze(2).unsqueeze(2) x=c_embed*weight[0]+w_embed*weight[1]+h_embed*weight[2] x=self.proj_drop(self.proj(x)) ``` 其他的两个线性层在使用时都有问题可以看到这一步 ```python h_embed=x.reshape(B,H,W,self.seg_dim,S).permute(0,3,2,1,4).reshape(B,self.seg_dim,W,H*S) ```...

ZVChen

Criss-Cross Attention & Axial Attention

1

Hello, I think Criss-Cross Attention & Axial Attention are also the commonly used attention mechanisms.

Liqq1

The paper link of MobileViT seems wrong

1

The link seems wrong, it is the same to ‘Coordinate Attention for Efficient Mobile Network Design’

DUT-CSJ

MLP Confusion

1

https://github.com/xmu-xiaoma666/External-Attention-pytorch/blob/2f80b03ef1cdd835d4a2d21eff6f8b3534e5d601/model/attention/CoAtNet.py#L21 Correct me, if I am wrong but isn't MLP usually a collection of fully-connected layers and not convolution layers?

abhimanyuchadha96

可以考虑一下Fully Attentional嘛？

1

paper：Fully Attentional Network for Semantic Segmentation ![image](https://user-images.githubusercontent.com/38532963/146382096-8959fed3-d10d-41a0-8d21-10f7cba9f56e.png)

YangParky

External-Attention-pytorch
External-Attention-pytorch copied to clipboard

Metadata

shuffletransformer

the danet attention model, when i change teh h w of feature,it is wrong

会考虑实现deformable self-attention嘛？

CondConv and DynamicConv are the same

可以把论文PDF也放在这个库里么，这样会不会更方便一点（如果在不会侵犯什么著作权的情况下）

WeightedPermuteMLP代码中的Linear问题？

Criss-Cross Attention & Axial Attention

The paper link of MobileViT seems wrong

MLP Confusion

可以考虑一下Fully Attentional嘛？

← Metadata

Owner

Metadata

External-Attention-pytorch External-Attention-pytorch copied to clipboard

Metadata

← Metadata

Owner

Metadata

External-Attention-pytorch
External-Attention-pytorch copied to clipboard