Aggregation-Cross-Entropy icon indicating copy to clipboard operation
Aggregation-Cross-Entropy copied to clipboard

I would like to ask you how to accurately predict the character order of a word.

Open Meicsu199345 opened this issue 5 years ago • 6 comments

I recreated your project and found that the input GT was converted into a word list, which had lost its order, and your prediction only provided the number of characters. Only through the two-dimensional matrix position of the network output can barely judge the order, I would like to ask you how to accurately predict the character order of a word.

Meicsu199345 avatar Jun 25 '19 02:06 Meicsu199345

+1 upvote

lamhoangtung avatar Jul 02 '19 02:07 lamhoangtung

As mentioned in the paper, to decode the 2D prediction, we flattened the 2D prediction by concatenating each column in order from left to right and top to bottom and then decoded the flattened 1D prediction following the general procedure.

summerlvsong avatar Jul 04 '19 08:07 summerlvsong

@summerlvsong ,the target label don't need a fixed order?

chenjun2hao avatar Jul 25 '19 11:07 chenjun2hao

and i am confused about the 2D example, the label texts don't have a fixed order. if so, how to solve the 1D problem. waiting your reply.

chenjun2hao avatar Jul 25 '19 11:07 chenjun2hao

During training, we don't need a fixed order for supervision. When testing, for the 2D scene text recogntion problem, we use the hypothesis that character distribute form left to right in the 2D output. Therefore, we can decode the 2D prediction by flattening the 2D prediction by concatenating each column in order from left to right and top to bottom and then decoding the flattened 1D prediction following the general procedure.

summerlvsong avatar Aug 01 '19 01:08 summerlvsong

@summerlvsong Thanks for your reply.

yinghuozijin avatar Nov 19 '19 03:11 yinghuozijin