MaskTextSpotter icon indicating copy to clipboard operation
MaskTextSpotter copied to clipboard

how to get label of segmentations in SynthText

Open TriplePool opened this issue 5 years ago • 10 comments

Thank you for your work. Could you please release the code of pre-processing of SynthText.

TriplePool avatar Nov 09 '19 05:11 TriplePool

@TriplePool I will release the code of pre-processing of SynthText later. I hope the converted annotations are helpful to you currently.

MhLiao avatar Nov 12 '19 09:11 MhLiao

@MhLiao Thank you very much.

TriplePool avatar Nov 12 '19 09:11 TriplePool

@MhLiao Hi, I have downloaded the converted annotations, and the label of segmentations is a quadrangle that from "wordBB" in SynthText instead of polygon like TotalText. Is it necessary that pre-train the model with SynthText polygons label of word region, if I want to handle with curved text?

TriplePool avatar Nov 12 '19 11:11 TriplePool

@TriplePool In my opinion, more accurate annotations would be better. But I did not try that. If you do care about the performance of curved text, you can try it.

MhLiao avatar Nov 14 '19 03:11 MhLiao

@MhLiao the char annotation of icdar2013 was used in training?

ChChwang avatar Nov 21 '19 07:11 ChChwang

@TriplePool In my opinion, more accurate annotations would be better. But I did not try that. If you do care about the performance of curved text, you can try it.

Got it, thank you

TriplePool avatar Nov 23 '19 07:11 TriplePool

@MhLiao the char annotation of icdar2013 was used in training?

As far as I know, there is no char annotation in icdar2013 dataset, is there?

TriplePool avatar Nov 23 '19 07:11 TriplePool

@TriplePool @ChChwang The char annotation of icdar2013 was used in training.

MhLiao avatar Nov 23 '19 11:11 MhLiao

@MhLiao there is error in maskrcnn_benchmark/data/datasets/synthtext.py, what if "," in labels? see the label of city_112_47.jpg.

ChChwang avatar Nov 27 '19 10:11 ChChwang

Please help me. After I use other methods to predict the results, I get .txt such as res_img_1.txt, and each line is x1, y1, x2, y2, x3, y3, x4, y4, word. I don't know how to generate S, W, and G results. How to evaluate the end-to-end result? Please help me.

ustczhouyu avatar Nov 16 '20 00:11 ustczhouyu