LAVIS issues

Results 282 LAVIS issues

Sort by recently updated

BLIP Image Captioning GradCAM?

Hi, I used BlipForConditionalGeneration from transformers for image captioning. I want to visualize the reason of generated caption (word by word) like GradCAM. I found a code from Albef (https://github.com/salesforce/ALBEF/blob/main/visualization.ipynb),...

gwyong

BLIP2 support for train_retrieval_coco.sh

Thank you so much for the code! It is pretty useful! Could you please also open source the retrieval training based on BLIP2? Any help is greatly appreciated.

wentaozhu

BLIP image text retrival evaluation score

Hello, I appreciate the work you've done. I would like to ask you a question about how to interpret the image text retrieval score. I received a score like this:...

shengyi4

what is the difference between the Instructed Zero-shot Image-to-Text Generation and Visual Question Answering about BLIP2?

In my understanding, VQA is similar with the ability of zero-shot image-to-text generation mentioned in the BLIP2 paper. They all give the answer about prompt(question / natural language instructions) conditioned...

stu-GYULA