EAST
EAST copied to clipboard
exuse me!!!!i have question,my data is image which full of long words,just like books,but when i train model by my data,it can not detect all the words in one line.maybe it can only detect half of a line and my input_size is 768 and text_scale is 768.could you tell me where i maybe wrong?thank you so much
exuse me!!!!i have question,my data is image which full of long words,but when i train model by my data,it can not detect all the words in one line.maybe it can only detect half of a line and my input_size is 768 and text_scale is 768.could you tell me where i maybe wrong?thank you so much @argman @zxytim
.............can you provide some sample images?
i am sorry , i ever thought issue can not show image.......@argman just like this image
currently EAST is more suitable for detecting words(or short text), if you want to detect long texts, you can resize the width side. If your images contain all like this, I will recommend CTPN
could you tell me why it can not detect long text? in fact, is long text regression hard to train? @argman
no, because the receptive field of CNN is too small
yes, i see, thank you so much!!!
@cjt222 I also face such a problem, do you have a solution?
@xiaomaxiao , @cjt222 : Did you guys manage to solve it? I am also stuck here. In case I resolve it before you guys revert then will update it here.
@ravikantb now , i use ctpn
@xiaomaxiao : Thanks for your response. Could you please tell me if you are using following implementation of CTPN: https://github.com/eragonruan/text-detection-ctpn or you have implemented your own?
@ravikantb yes , we can talk together 。
Sure, thanks! :)
CTPN works badly on oriented images. Does anyone make east to work on horizontal long text images?
HAHA, same issue. I am looking for the same solution as well @dajiangxiaoyan
Let me know if anyone has an alternative or solution for this. CTPN doesn't fit for my needs.
@argman - You mentioned that the receptive field of the CNN is too small. Does tweaking this CNN part be of any use? Did anyone try it?
@argman I have faced the same problem with long text too. But the main problem is the model performance is not consistent. I have done the test to understand the limits of the model on long text with varying fontsizes. Sometime it captures the most of the test and sometimes it misses the same whole line which it has already detected previously in another line.PFB the images for better understanding.
Also the resnet-50 has full image receptive field so how is it going to effect to detect long text?
If effecting, did anyone try modifying the receptive field to make the model work in long text?
thanks in advance
@renukachittimalla did you solve this issue?
@DonkeySmall what is this? how did you train that model?
This is EAST model trained on my own dataset, I don't remember the training parameters, and I don't have the dataset left.
Something like this

hey has anyone solved the problem?? :) I really need a solution for this problem
@argman @zxytim @renukachittimalla is there any solution yet? EAST cant detect long text lines?
What sort of aspect ratio of "long text" are you looking to detect?
The two main knobs for optimizing performance on long text is:
- resolution (essentially number of pixels each letter occupies)
- training data (if your training data represents the types of text aspect ratio you are expecting)
I'd suggest tuning the model using (1) and (2) above.
