qiang chen comments

Results 37 comments of


                                            qiang chen

[WIP] Release code of MixFormer (CVPR2022, Oral)

@Seperendity 你好，这个是可以通过[广播机制](https://www.paddlepaddle.org.cn/documentation/docs/zh/guides/beginner/tensor_cn.html#id7)来实现的

[WIP] Release code of MixFormer (CVPR2022, Oral)

@Seperendity 你好，这样做是为了配合v的维度。举个例子理解一下，假设v的shape是`[B, C, H, W]`，x_cnn2v的shape是`[B, C, 1, 1]`，那么`v = v * x_cnn2v`是一个简单的channel attention。但是，在代码里的第223行，因为后续要准备做window-based self-attention，v的shape是`[B*(H/win)*(W/win), win*win, num_heads, C/num_heads]`，而x_cnn2v的shape是`[B, C, 1, 1]`，这个时候没法直接做channel attention。当然这里可以用不同的实现： 1. 你可以把v再reshape回`[B, C, H, W]`，做完channel attention之后，再变成`[B*(H/win)*(W/win), win*win, num_heads, C/num_heads]`，再进入到下面的self-attention。...

qiang chen

[WIP] Release code of MixFormer (CVPR2022, Oral)

[WIP] Release code of MixFormer (CVPR2022, Oral)

Hi, could you provide a detailed log of the multi-scale training(R50_C5)?

Hi, could you provide a detailed log of the multi-scale training(R50_C5)?

Hi, could you provide a detailed log of the multi-scale training(R50_C5)?

Hi, could you provide a detailed log of the multi-scale training(R50_C5)?

Hi, could you provide a detailed log of the multi-scale training(R50_C5)?

Hi, could you provide a detailed log of the multi-scale training(R50_C5)?

Hi, could you provide a detailed log of the multi-scale training(R50_C5)?

Hi, could you provide a detailed log of the multi-scale training(R50_C5)?