rr191211

Results 3 issues of rr191211

How to handle multiple images with Blip2 models? I have a large number of questions which require more than one image to answer for VQA task, like 1 questions vs...

Thanks for your awesome work in UDOP, I was trying out Finetune on VQA. Do you plan to release the code to pre-train such a model? Thank you!

Dear authors, Thank you for such seminal work! I am wondering about the training time and the required hardware resource. If I could be wrong, I didn't find related statements...