deep-text-recognition-benchmark icon indicating copy to clipboard operation
deep-text-recognition-benchmark copied to clipboard

how to recognize the blank between two English words

Open ghost opened this issue 5 years ago • 6 comments

How to recognize blank between two English words? for my current model, if I input one English sentence then the output will concatenate all the English words together. for example: inputed image: 36 recognized result:

A------l-ll-t-h--e--r-e-c--o--g-n-i-tiio---n--a-c-cc--ur-a--c-i-e-s---o--n--t-h---e => Alltherecognitionaccuraciesonthe

So how to recognize the blank between two English words?

Thank you so much.

ghost avatar Apr 26 '20 07:04 ghost

In the CRAFT project you need to set link_threshold and low_text higher so that it separates the words before you pass them as images to the OCR. CRAFT is responsible for cleanly separating text, this project just reads the images without knowing what a space is.

YacobBY avatar May 15 '20 10:05 YacobBY

@YacobBY thanks for the reply.

ghost avatar May 16 '20 20:05 ghost

@YacobBY Yes, the CRAFT project does exactly that. But afterward, how do we connect back the words that belong in the same sentence?

rexlow avatar Aug 11 '20 03:08 rexlow

@YacobBY Yes, the CRAFT project does exactly that. But afterward, how do we connect back the words that belong in the same sentence?

probably by computing proximities of bounding boxes

fingoldo avatar Oct 10 '20 06:10 fingoldo

@YacobBY

Hi Did you find a solution for this problem? I mean connecting back the words that belong in the same sentence. In my case its numbers separated by .

Thanks in advance

AhmadZobairSurosh avatar Mar 02 '22 09:03 AhmadZobairSurosh

@AhmadZobairSurosh There is no real way to do it with deep-learning, probably just check whether the word next to it is on the same vertical pixel height within a margin and not more than a certain amount of horizontal pixels away.

YacobBY avatar Mar 02 '22 09:03 YacobBY