EAST icon indicating copy to clipboard operation
EAST copied to clipboard

exuse me!!!!i have question,my data is image which full of long words,just like books,but when i train model by my data,it can not detect all the words in one line.maybe it can only detect half of a line and my input_size is 768 and text_scale is 768.could you tell me where i maybe wrong?thank you so much

Open cjt222 opened this issue 8 years ago • 25 comments

exuse me!!!!i have question,my data is image which full of long words,but when i train model by my data,it can not detect all the words in one line.maybe it can only detect half of a line and my input_size is 768 and text_scale is 768.could you tell me where i maybe wrong?thank you so much @argman @zxytim

cjt222 avatar Nov 06 '17 01:11 cjt222

.............can you provide some sample images?

argman avatar Nov 06 '17 01:11 argman

i am sorry , i ever thought issue can not show image.......@argman just like this image

cjt222 avatar Nov 06 '17 02:11 cjt222

currently EAST is more suitable for detecting words(or short text), if you want to detect long texts, you can resize the width side. If your images contain all like this, I will recommend CTPN

argman avatar Nov 06 '17 02:11 argman

could you tell me why it can not detect long text? in fact, is long text regression hard to train? @argman

cjt222 avatar Nov 06 '17 02:11 cjt222

no, because the receptive field of CNN is too small

argman avatar Nov 06 '17 02:11 argman

yes, i see, thank you so much!!!

cjt222 avatar Nov 06 '17 03:11 cjt222

@cjt222 I also face such a problem, do you have a solution?

xiaomaxiao avatar Nov 13 '17 09:11 xiaomaxiao

@xiaomaxiao , @cjt222 : Did you guys manage to solve it? I am also stuck here. In case I resolve it before you guys revert then will update it here.

ravikantb avatar Feb 24 '18 11:02 ravikantb

@ravikantb now , i use ctpn

xiaomaxiao avatar Feb 24 '18 12:02 xiaomaxiao

@xiaomaxiao : Thanks for your response. Could you please tell me if you are using following implementation of CTPN: https://github.com/eragonruan/text-detection-ctpn or you have implemented your own?

ravikantb avatar Feb 24 '18 12:02 ravikantb

@ravikantb yes , we can talk together 。

xiaomaxiao avatar Feb 24 '18 12:02 xiaomaxiao

Sure, thanks! :)

ravikantb avatar Feb 24 '18 12:02 ravikantb

CTPN works badly on oriented images. Does anyone make east to work on horizontal long text images?

dajiangxiaoyan avatar Mar 29 '18 05:03 dajiangxiaoyan

HAHA, same issue. I am looking for the same solution as well @dajiangxiaoyan

gangliu123 avatar Mar 30 '18 09:03 gangliu123

Let me know if anyone has an alternative or solution for this. CTPN doesn't fit for my needs.

codecolony avatar May 18 '18 02:05 codecolony

@argman - You mentioned that the receptive field of the CNN is too small. Does tweaking this CNN part be of any use? Did anyone try it?

codecolony avatar May 18 '18 06:05 codecolony

@argman I have faced the same problem with long text too. But the main problem is the model performance is not consistent. I have done the test to understand the limits of the model on long text with varying fontsizes. Sometime it captures the most of the test and sometimes it misses the same whole line which it has already detected previously in another line.PFB the images for better understanding. img_22 img_4 Also the resnet-50 has full image receptive field so how is it going to effect to detect long text? If effecting, did anyone try modifying the receptive field to make the model work in long text?

thanks in advance

renukachittimalla avatar Jan 09 '19 15:01 renukachittimalla

@renukachittimalla did you solve this issue?

wiistriker avatar Jul 25 '20 06:07 wiistriker

@DonkeySmall what is this? how did you train that model?

wiistriker avatar Jul 25 '20 19:07 wiistriker

This is EAST model trained on my own dataset, I don't remember the training parameters, and I don't have the dataset left.

ghost avatar Jul 26 '20 07:07 ghost

Something like this

image_591311

ghost avatar Jul 26 '20 07:07 ghost

hey has anyone solved the problem?? :) I really need a solution for this problem

daniiki avatar Oct 21 '20 10:10 daniiki

@argman @zxytim @renukachittimalla is there any solution yet? EAST cant detect long text lines?

wiistriker avatar Oct 23 '20 07:10 wiistriker

What sort of aspect ratio of "long text" are you looking to detect?

The two main knobs for optimizing performance on long text is:

  1. resolution (essentially number of pixels each letter occupies)
  2. training data (if your training data represents the types of text aspect ratio you are expecting)

I'd suggest tuning the model using (1) and (2) above.

khu834 avatar Nov 18 '20 23:11 khu834