[WIP][Bug fix] use pad_shape instead of img_shape for bevformer and recover the origin code where the first frame pre_bev is None
-
Use
pad_shapeinstead ofimg_shapefor bevformer, whereimg_shapereserves the origin image shape andpad_shapeis the image shape after padded. -
Recover the origin codes where the first frame
pre_bevis None. Present codes set the first framepre_bevto zeros during the trainng phase in order to keep consistent with the deploy phase, but it seems to be not too right. The reasons are two below:
- Temporal information cannot be closed flexiblely where the
pre_bevshould be set as None. - When temporal information is needed,
pre_bevis the key and value of the temporal encoder layers. Ifpre_bevis None, the encoder layer will set the value ofpre_bevthe same as query.
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
luoqianhui seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
You have signed the CLA already but the status is still pending? Let us recheck it.
This PR conflicts with the latest code, please fix it