yi-ming-qian
Results
2
issues of
yi-ming-qian
The issue is about the [text localization example](https://github.com/salesforce/LAVIS/blob/main/examples/blip_text_localization.ipynb). The input image is "../docs/_static/merlion.png" while the input caption is changed to "Merlion near marina bay. It is a city in Singapore....
Hello, thanks for sharing the work, it is very inspiring. I wonder if you can share the attention extraction and visualization script used for creating Figure 2 in the paper?