attention-viz icon indicating copy to clipboard operation
attention-viz copied to clipboard

text to image view

Open 18445864529 opened this issue 1 year ago • 1 comments

First thank you for the great work!

I would like to know whether this tool can also do text-to-image attention views for large vision-language models such as MiniGPT-4, LLaVA, InstructBLIP, etc.?

Thanks!

18445864529 avatar Sep 18 '23 10:09 18445864529

Thanks so much for checking out AttentionViz! We have not tried visualizing text-to-image attention yet but I think our tool/technique can feasibly be extended to vision-language models and this is definitely a great direction for future work.

catherinesyeh avatar Sep 19 '23 22:09 catherinesyeh