zhangxi1997
zhangxi1997
Hello, thanks for your work! Can you provide the model that only pre-trained on the VG dataset? Thanks a lot, and looking for your reply!
Hi, thanks for your sharing. I find that the STVQA method needs the visual features with the shape [batch_size, fnum, feat_dim, w, h]. How can I get this kind of...
Hi, thanks for your sharing. I wonder can you provide the pre-trained BERT features of the candidate answers? Thanks a lot!
Thanks for your interesting work! When I download the UPMC Food-101 dataset, the link https://visiir.isir.upmc.fr/explore is invalid with 502 bad gateway. I have downloaded the images of UPMC Food-101 from...
Thanks for your code! How can I obtain the extracted bbox region features for NExT-QA? Looking forward to your reply.
**Describe the bug** Looking forward to the QuickStart!