awesome-deep-text-detection-recognition icon indicating copy to clipboard operation
awesome-deep-text-detection-recognition copied to clipboard

ASTER is not end-to-end in the normal sense

Open jdhao opened this issue 6 years ago • 2 comments

End-to-end recognition means that the whole image is feed into the network and the network will output the recognition result for the whole image.

The input image to the network in ASTER is not the whole image, but a small part containing the warped text. I think it is more proper to call ASTER a recognition algorithm which can deal with irregular text images.

jdhao avatar Aug 06 '19 03:08 jdhao

Thank you for pointing out.

As you mentioned, ASTER is a recognition algorithm. So you can see ASTER in 'Text Recognition' part.

However, A paper of ASTER also showed end-2-end performance for 2 stage approach, ASTER (recognition) followed by TextBoxes (detector).

That's why I added ASTER in 'End-to-End Text Recognition' part also. Please see TABLE 8 in ASTER paper. (http://122.205.5.5:8071/UpLoadFiles/Papers/ASTER_PAMI18.pdf)

hwalsuklee avatar Aug 06 '19 07:08 hwalsuklee

But Table 8 can not justify that ASTER is an end-to-end method. It is coupled with TextBoxes. ASTER is just used for recognizing the text boxes.

End-to-end method means that the detection and recognition are performed by a single method, not cascading of two different methods from two different papers.

jdhao avatar Aug 06 '19 13:08 jdhao