BEVFormer icon indicating copy to clipboard operation
BEVFormer copied to clipboard

Abotu BEVFormer-S

Open LuckySunnye opened this issue 3 years ago • 11 comments
trafficstars

Thanks for your great job! I want to training BEVFormer without temporal information, can you tell me how to adjust config.py and code to reproduce the results of BEVFormer-S in your paper? image

LuckySunnye avatar Jul 02 '22 00:07 LuckySunnye

I have the same question and hope to give a tutorial about training BEVFormer-S, thx.

Bosszhe avatar Jul 02 '22 07:07 Bosszhe

I also have the same question. Can I get a tutorial to reproduce the BEVFormer-S?

yeeoon avatar Jul 04 '22 01:07 yeeoon

Hi! thanks for your great work. @zhiqi-li I also want to reproduce the result of BEVFormer-s. And I get the following result, which is lower than paper.

截屏2022-07-04 下午7 12 18

I modify the detectors/bevformer.py forward_train function : image

czy341181 avatar Jul 04 '22 11:07 czy341181

I modify the detectors/bevformer.py forward_train function :

For bevformer-S, you should set video_test_mode=False in your config.

zhiqi-li avatar Jul 04 '22 14:07 zhiqi-li

So, the result of BEVFormer-S in original paper is trained under TemporalSelfAttention while tested under degraded SelfAttention(with video_test_mode=False in config) ?

Roserland avatar Jul 05 '22 04:07 Roserland

So, the result of BEVFormer-S in original paper is trained under TemporalSelfAttention while tested under degraded SelfAttention(with video_test_mode=False in config) ?

NO, BEVFormer-S was trained with only one sample without temporal information, and in this case, TemporalSelfAttention is just one self-attention. So that, for BEVFormer-S, you should set video_test_mode=False while testing.

zhiqi-li avatar Jul 05 '22 04:07 zhiqi-li

Thanks for your timely reply. I got the results of paper.

czy341181 avatar Jul 05 '22 05:07 czy341181

So, the result of BEVFormer-S in original paper is trained under TemporalSelfAttention while tested under degraded SelfAttention(with video_test_mode=False in config) ?

NO, BEVFormer-S was trained with only one sample without temporal information, and in this case, TemporalSelfAttention is just one self-attention. So that, for BEVFormer-S, you should set video_test_mode=False while testing.

Thank you! And Excellent work!

Roserland avatar Jul 05 '22 05:07 Roserland

Thanks for your timely reply. I got the results of paper.

Hi, could you kindly explain how to get the results of the paper, by training with video_test_mode=False, len_queue=1 and testing with video_test_mode=False? Or other setting?

JudasDie avatar Sep 19 '22 09:09 JudasDie

@czy341181 Do you train BEVFormer-S from scratch and get the results of the paper?

Daniel-xsy avatar Mar 06 '23 10:03 Daniel-xsy

So, the result of BEVFormer-S in original paper is trained under TemporalSelfAttention while tested under degraded SelfAttention(with video_test_mode=False in config) ?

NO, BEVFormer-S was trained with only one sample without temporal information, and in this case, TemporalSelfAttention is just one self-attention. So that, for BEVFormer-S, you should set video_test_mode=False while testing.

Hi, do you mean set the data queue length to 1 to train static version while keep all other settings the same with the temporal version?

wenyuqing avatar Mar 08 '24 04:03 wenyuqing