whqwill

Results 2 issues of whqwill

I use https://github.com/peteanderson80/bottom-up-attention/ for feature extraction on my own images, and then run the image caption model, but the result caption is incomplete. e.g. ![image](https://user-images.githubusercontent.com/7381876/112450451-61779a00-8d8f-11eb-9c0f-1fe6e1996ee0.png) caption: "a view of a...

any paper or algorithm description about text extraction? I want to know its theory details, thanks