SynthText
SynthText copied to clipboard
The height of bounding boxes for some words is much larger than the words
This is causing some problems when determining the midpoints for the bounding boxes for the fully convolutional network. It's basically putting the midpoint in a different spot then it would have been had the bounding box tightly enclosed the text. Any ideas?
This happens for some fonts when they are "underlined" -- this increases the glyph height as the "underline" position is set at the lowest point, e.g., below the end of p
. This problem should go away if you don't use the underlined glyphs.
I am not using any underlining, at least as far I can tell. I set the underline probability to be 0.0.
On Thu, Mar 2, 2017 at 5:35 PM, Ankush Gupta [email protected] wrote:
This happens for some fonts when they are "underlined" -- this increases the glyph height as the "underline" position is set at the lowest point, e.g., below the end of p. This problem should go away if you don't use the underlined glyphs.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/ankush-me/SynthText/issues/25#issuecomment-283805059, or mute the thread https://github.com/notifications/unsubscribe-auth/ABL1YLCQIC7830JN3gpaYmNO1d4kDt9Zks5rh0RDgaJpZM4MRnEw .
Not sure then. Do you know if this is a problem with specific fonts?
I'm checking into it now.
Another thing I'm trying to do is use your FCRN concept to create a text detection framework (whereby I basically only use the "c" portion of your FCRN but determine if the cell contains text at all rather than the midpoint of a bounding box. I'm currently looking through the SyntheText python code to see if there's a good way I can get hold of the actual text pixels in a separate surface/img (to use as a mask) for labelling my "detection FCRN". Off hand, I'm seeing where the text is rendered to what looks like a blank pygame surface. Hopefully I can use this to extract the mask and label my 32x32 grid for my training set.
On Thu, Mar 2, 2017 at 6:06 PM, Ankush Gupta [email protected] wrote:
Not sure then. Do you know if this is a problem with specific fonts?
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/ankush-me/SynthText/issues/25#issuecomment-283811845, or mute the thread https://github.com/notifications/unsubscribe-auth/ABL1YNCxiBpP7n4VnJiLNyjPclb8gm_Aks5rh0tzgaJpZM4MRnEw .
Ah so this is actually a binary mask of the text drawn on the image? That's exactlyw hat I need is basically like a layer with only text (after it's been warped) so that I can determine which cells (32x32 grid, just like your FCRN implementation) actually contain text.
I got it! Looking @ the feathered mask. Perfect, Ankush. Thanks again!
@cjnolet @ankush-me
Facing the same bounding box height issues for few of the text boxes. Pretty mysterious. I'm trying to debug the whole thing to zero in on the issue. Meanwhile, the cause is already known to anyone?
@cjnolet I have a similar use case, please advise :
- elaborate on how to extract binary masks, can I generate them per character or per word.
- How to find absolute center point per bbox
- How to calculate width per char, can it be equal to width of char level bbox ?