attention-viz text to image view

text to image view

Open 18445864529 opened this issue 1 year ago • 1 comments

First thank you for the great work!

I would like to know whether this tool can also do text-to-image attention views for large vision-language models such as MiniGPT-4, LLaVA, InstructBLIP, etc.?

Thanks!

Sep 18 '23 10:09 18445864529

Thanks so much for checking out AttentionViz! We have not tried visualizing text-to-image attention yet but I think our tool/technique can feasibly be extended to vision-language models and this is definitely a great direction for future work.

Sep 19 '23 22:09 catherinesyeh

attention-viz attention-viz copied to clipboard

text to image view

attention-viz
attention-viz copied to clipboard