attention-ocr icon indicating copy to clipboard operation
attention-ocr copied to clipboard

Sliding window's width on image?

Open githubpiyush opened this issue 5 years ago • 3 comments

The gif's you have included as output, can we change the width of those sliding windows?

githubpiyush avatar Jul 22 '19 09:07 githubpiyush

You’d have to change the model for that, as the windows represent the attention part of the model. You could also probably adjust the drawing method, but then the representation wouldn’t be accurate anymore, so I’m not really sure why you’d want to do that. On Jul 22, 2019, 16:45 +0700, githubpiyush [email protected], wrote:

The gif's you have included as output, can we change the width of those sliding windows? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

emedvedev avatar Jul 22 '19 09:07 emedvedev

How is the width is decided? Like which part of the code? Is it learning from previous input's attention or what? Thanks for the response

githubpiyush avatar Jul 22 '19 09:07 githubpiyush

The Drawing method is "visualize_attention" in model.py file?

I want to increase the width because if i can increase the width then characters in my input image can be recognized properly.

Edit: How to increase the attention mask size?

githubpiyush avatar Jul 22 '19 10:07 githubpiyush