YUAN, Zhihao
YUAN, Zhihao
Hi @sH1cHEnG , You can refer to https://github.com/MILVLG/mcan-vqa for the implementaion. We may release the code in the future. Best, Zhihao
Hi @ayushjain1144 , the Acc@mIoU is only used in ScanRefer. For referit3d, we use GT instance as input and evaluate the grounding accuracy solely. Please refer to their paper for...
Same question here. Do you have the plan to support directly inverse conv3d?
Yes, the checkpoint can get satisfactory results. As I said above, I can get the detection result mAP@50 around 50 with the given checkpoint.
Any progress?
Delete `[:self.max_des_len]` here. https://github.com/daveredrum/D3Net/blob/b505e984cc4b01ea6ed95aa94b7bafa45215f4f4/lib/dataset/pipeline.py#L453
Same problem here.
X-Trans2Cap model use 2D information in training stage only.
You can use the teacher model in inference. Uncomment below and change input feats accordingly. https://github.com/CurryYuan/X-Trans2Cap/blob/c78a27209f14fcbbec74fe8b5edc06faea2e7d44/models/xtrans.py#L302
Please save the nr3d.csv to json file similar to scanrefer.