EAST exuse me！！！！i have question,my data is image which full of long words,just like books,but when i train model by my data,it can not detect all the words in one line.maybe it can only detect half of a line and my input_size is 768 and text_scale is 768.could you tell me where i maybe wrong?thank you so much

exuse me！！！！i have question,my data is image which full of long words,just like books,but when i train model by my data,it can not detect all the words in one line.maybe it can only detect half of a line and my input_size is 768 and text_scale is 768.could you tell me where i maybe wrong?thank you so much

Open cjt222 opened this issue 8 years ago • 25 comments

exuse me！！！！i have question,my data is image which full of long words,but when i train model by my data,it can not detect all the words in one line.maybe it can only detect half of a line and my input_size is 768 and text_scale is 768.could you tell me where i maybe wrong?thank you so much @argman @zxytim

Nov 06 '17 01:11 cjt222

.............can you provide some sample images?

Nov 06 '17 01:11 argman

i am sorry , i ever thought issue can not show image.......@argman just like this image

Nov 06 '17 02:11 cjt222

currently EAST is more suitable for detecting words(or short text), if you want to detect long texts, you can resize the width side. If your images contain all like this, I will recommend CTPN

Nov 06 '17 02:11 argman

could you tell me why it can not detect long text? in fact, is long text regression hard to train? @argman

Nov 06 '17 02:11 cjt222

no, because the receptive field of CNN is too small

Nov 06 '17 02:11 argman

yes, i see, thank you so much!!!

Nov 06 '17 03:11 cjt222

@cjt222 I also face such a problem, do you have a solution?

Nov 13 '17 09:11 xiaomaxiao

@xiaomaxiao , @cjt222 : Did you guys manage to solve it? I am also stuck here. In case I resolve it before you guys revert then will update it here.

Feb 24 '18 11:02 ravikantb

@ravikantb now , i use ctpn

Feb 24 '18 12:02 xiaomaxiao

@xiaomaxiao : Thanks for your response. Could you please tell me if you are using following implementation of CTPN: https://github.com/eragonruan/text-detection-ctpn or you have implemented your own?

Feb 24 '18 12:02 ravikantb

@ravikantb yes , we can talk together 。

Feb 24 '18 12:02 xiaomaxiao

Sure, thanks! :)

Feb 24 '18 12:02 ravikantb

CTPN works badly on oriented images. Does anyone make east to work on horizontal long text images?

Mar 29 '18 05:03 dajiangxiaoyan

HAHA， same issue. I am looking for the same solution as well @dajiangxiaoyan

Mar 30 '18 09:03 gangliu123

Let me know if anyone has an alternative or solution for this. CTPN doesn't fit for my needs.

May 18 '18 02:05 codecolony

@argman - You mentioned that the receptive field of the CNN is too small. Does tweaking this CNN part be of any use? Did anyone try it?

May 18 '18 06:05 codecolony

@argman I have faced the same problem with long text too. But the main problem is the model performance is not consistent. I have done the test to understand the limits of the model on long text with varying fontsizes. Sometime it captures the most of the test and sometimes it misses the same whole line which it has already detected previously in another line.PFB the images for better understanding. Also the resnet-50 has full image receptive field so how is it going to effect to detect long text? If effecting, did anyone try modifying the receptive field to make the model work in long text?

thanks in advance

Jan 09 '19 15:01 renukachittimalla

@renukachittimalla did you solve this issue?

Jul 25 '20 06:07 wiistriker

>>>frozen_model.pb<<<

test1 test2 test3

Jul 25 '20 07:07 ghost

@DonkeySmall what is this? how did you train that model?

Jul 25 '20 19:07 wiistriker

This is EAST model trained on my own dataset, I don't remember the training parameters, and I don't have the dataset left.

Jul 26 '20 07:07 ghost

Something like this

Jul 26 '20 07:07 ghost

hey has anyone solved the problem?? :) I really need a solution for this problem

Oct 21 '20 10:10 daniiki

@argman @zxytim @renukachittimalla is there any solution yet? EAST cant detect long text lines?

Oct 23 '20 07:10 wiistriker

What sort of aspect ratio of "long text" are you looking to detect?

The two main knobs for optimizing performance on long text is:

resolution (essentially number of pixels each letter occupies)
training data (if your training data represents the types of text aspect ratio you are expecting)

I'd suggest tuning the model using (1) and (2) above.

Nov 18 '20 23:11 khu834

EAST EAST copied to clipboard

EAST
EAST copied to clipboard