VL-T5 icon indicating copy to clipboard operation
VL-T5 copied to clipboard

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Results 22 VL-T5 issues
Sort by recently updated
recently updated
newest added

Hi jeamin, Thanks for you interesting work. I have been working with you codebase for the REG(referring expression generation) task.And I modify the fintune code to fit the REG task.And...

I'm running your VQA model on Google colab and I seem to get an error when loading the model weights: ``` Building Model at GPU 0 Some weights of VLT5VQA...

Hi, by running ```bash scripts/RefCOCOg_VLBart.sh 1``` I got the following error: ``` Original Traceback (most recent call last): File "/home/zuujhyt/miniconda3/envs/vlt5/lib/python3.7/site-packages/torch/utils/data/_utils/worker.py", line 185, in _worker_loop data = fetcher.fetch(index) File "/home/zuujhyt/miniconda3/envs/vlt5/lib/python3.7/site-packages/torch/utils/data/_utils/fetch.py", line...

When I use VLT5TokenizerFast to encode the sentence, there will be a token id 3 ( '▁') before id of token . For example, ```` from lib2to3.pgen2 import token from...

Thanks for your awesome work! I noticed you comment the mode.zero_grad() in https://github.com/j-min/VL-T5/blob/18699e2b1d5b4559f76c88a48cdec7176d356c34/VL-T5/src/caption.py#L225 So how could the model update its parameters?

Do you mind if you could add a refcoco inference example. Especially the maskrcnn model used for the inference. Thanks

![image](https://user-images.githubusercontent.com/16870890/156115914-5118cc49-b837-48e2-8c0b-4fc787d3bea4.png)

Hello! First of all thank you so much for your work. I have read your paper and I want to carry out some open-ended VQA/answer generation VQA experiments with the...

![image](https://user-images.githubusercontent.com/16870890/155083477-921be349-f727-46f2-bc41-26ca0e5af5cb.png) ![image](https://user-images.githubusercontent.com/16870890/155083749-7648ce5e-9b00-485c-9740-ea3262641cb2.png)

Dear authors, Thanks for your great work! I fine-tuned a VL-T5 model on Refcocog, based on your released checkpoint but got a bad result: `20.45407` I noticed that in your...