Qibin Hou comments

Results 40 comments of


                                            Qibin Hou

how to reproduce the result for cityscapes

Thanks for your interests in this paper. I have tried spnet50 on Cityscapes and the mAP score is 79.5 on the validation set (w/o hard example mining). Please check your...

intuition behind Outlook Attention Generation seems does not make sense

Hi, The local context aggregation is conducted in the value projection operation. The attention weights are generated by a linear but an unfold operation is operated on the value tensor....

intuition behind Outlook Attention Generation seems does not make sense

> Hi thank you for your paper and congrats on SOTA. > > I have a question related to this, from the linear projection we generate an attention map for...

Compare to DynamicConv

Thanks for your question. The difference is clear. The outlooker in VOLO is a new attention mechanism that targets at encoding fine-level token representations. We use a linear to generate...

A higher result than paper, but i don't know why.

Hi, thanks for sharing your log with us. Did you train any other models with your dataset and are they normal?

A higher result than paper, but i don't know why.

Have you ckeched that there is no problem with your val set?

A higher result than paper, but i don't know why.

Seems that there are some problems with the training data

Question about apex

Thanks for your comments. This is really important. As you suggested, I will release the training log soon.

Question about apex

I do use, but it depends on your hardware. Sometimes, apex amp works better in terms of training efficiency.

How to establish channel attention?

Two 1x1 convs are used to build the inter-channel relationship.