crnn icon indicating copy to clipboard operation
crnn copied to clipboard

The image of long width has a bad result, the short one does not

Open Jayhello opened this issue 7 years ago • 6 comments

For example the good result with short width image image

the bad result with long width image image

Jayhello avatar Feb 15 '17 03:02 Jayhello

because the best path decode method is not work well in long term , please change another ctc decode method

Duum avatar Feb 15 '17 05:02 Duum

@Duum Thank very much for your reply. But how to do this in detail ? And can you give some advise about the question https://github.com/bgshih/crnn/issues/39

Jayhello avatar Feb 15 '17 08:02 Jayhello

you can try Prefix Search Decoding or other decoding method.

Duum avatar Feb 22 '17 09:02 Duum

@Duum Why does the length of label have a negative effect on the precision of best path decoding? Is there any literature talking about this?

misssprite avatar May 17 '17 11:05 misssprite

I think the problem is given by the rescaling of the image to 100x32 size. For long sequence images, by applying this rescaling, single character appears very crushed, and this fact can impact the classification. I don't think best path decoding ca influence the results in such a way.

hellbago avatar Jun 27 '17 12:06 hellbago

@hellbago crnn is able to handle images of arbitrary width. So, for testing, i think you only have to resize the image along the height dimension while preserving aspect ratio, otherwise you will get a distorted image like you said

Heisenberg0391 avatar Jan 26 '19 02:01 Heisenberg0391