hello. I am studying 2DPASS with code. It seems that the modality fusion implementation is in network/arch_2dpass.py from line 100 to line 105. In the thesis, bitwise add is specified, but I do not see it in the code, so I ask a question. Below is the code.

modality fusion

feat_learner = F.relu(self.leanersidx) feat_cat = torch.cat([img_feat, feat_learner], 1) feat_cat = self.fcs1idx feat_weight = torch.sigmoid(self.fcs2idx) fuse_feat = F.relu(feat_cat * feat_weight)

I think that [fuse_feat = F.relu(feat_cat*feat_wieght) + img_feat] implements the formula in the paper as a code. Isn't it?

Apr 07 '23 15:04 kjwkch

i have the same question

thanks in advance

Apr 19 '23 08:04 brahami14

Have you guys successively reproduced the model on Nuscenes, I did several experiments but the performance is far away from the report results. And, I also tested the provided weight on Nuscences getting results similar to the reported results. I'd like to know if I forgot to set some arguments.

Apr 20 '23 10:04 LiXiang0021

This issue is code implementation, not performance. I think open code is not implemented as described in the paper.

Apr 21 '23 01:04 kjwkch

Missing in the code.

modality fusion

feat_learner = F.relu(self.leaners[idx](pts_feat))
feat_cat = torch.cat([img_feat, feat_learner], 1)
feat_cat = self.fcs1[idx](feat_cat)
feat_weight = torch.sigmoid(self.fcs2[idx](feat_cat))
fuse_feat = F.relu(feat_cat * feat_weight)

I think that [fuse_feat = F.relu(feat_cat*feat_wieght) + img_feat] implements the formula in the paper as a code.

feat_learner = F.relu(self.leaners[idx](pts_feat))
feat_cat = torch.cat([img_feat, feat_learner], 1)
feat_cat = self.fcs1[idx](feat_cat)
feat_weight = torch.sigmoid(self.fcs2[idx](feat_cat))
fuse_feat = F.relu(feat_cat * feat_weight) + img_feat

Apr 21 '23 01:04 kjwkch

Thanks for your reply, I will further check this issue.

Apr 21 '23 01:04 LiXiang0021

I just trained the modified version as you said, and the performance did improve a little bit around 2 on mIoU. I believe there may be some other wrong implements or missing in the released code. And thank you again.

Apr 21 '23 03:04 LiXiang0021

@kjwkch @LiXiang0021 After reviewing the current implementation, I noticed that besides the fusion modification, the point feature pass through the 2D learner needs to be added to the original point feature before passing through multihead_3d_classifier, in order to match the model architecture outlined in the paper as below: CleanShot 2023-04-27 at 16 29 24@2x

        feat_learner = F.relu(self.leaners[idx](pts_feat)) 
        # feat_learner -> voxel-wise feature after 2D learner

        pts_pred_full = self.multihead_3d_classifier[idx]((pts_feat+feat_learner)) 
        # pts_feat+feat_learner -> voxel-wise Enhanced 3D Features

        # correspondence
        pts_label_full = self.voxelize_labels(data_dict['labels'], data_dict['layer_{}'.format(idx)]['full_coors'])
        pts_pred = self.p2img_mapping(pts_pred_full[coors_inv], point2img_index, batch_idx)

        # modality fusion

        feat_learner = self.p2img_mapping(feat_learner[coors_inv], point2img_index, batch_idx)
        # feat_learner -> point-wise feature after 2D learner and img_mapping

        feat_cat = torch.cat([img_feat, feat_learner], 1)
        feat_cat = self.fcs1[idx](feat_cat)
        feat_weight = torch.sigmoid(self.fcs2[idx](feat_cat))
        fuse_feat = F.relu(feat_cat * feat_weight) + img_feat

Currently, the implementation takes the point feature as input directly for multihead_3d_classifier instead of adding the point feature after the 2D learner. https://github.com/yanx27/2DPASS/blob/80b8646bbb0dd46ddcee31f531d6f1c45baf35fe/network/arch_2dpass.py#L89-L93

@yanx27, I would appreciate any suggestions you may have regarding this matter.

Apr 27 '23 08:04 jaywu109

@jaywu109 does the new script changes work for you ?

Jun 06 '23 13:06 brahami14

2DPASS
2DPASS copied to clipboard

Modality fusion implementation question

modality fusion

modality fusion

2DPASS 2DPASS copied to clipboard

Modality fusion implementation question

modality fusion

modality fusion

2DPASS
2DPASS copied to clipboard