pren icon indicating copy to clipboard operation
pren copied to clipboard

Failed to reproduce the results in the paper when training from the scratch

Open becxer opened this issue 2 years ago • 5 comments

Hello, we have a problem with reproducing the results in the paper.

With the official code and the default parameters for training, we are not able to reach the desirable scores except IC03 and IC13.

Method Train Opt Epoch IC03 IC13 IC15 IIIT5k SVT SVTP CUTE
PREN(Paper) - - 94.90 94.70 79.20 92.10 92.00 83.90 81.30
PREN(w/ Official code) default 3 95.23 94.52 76.97 84.33 87.33 79.23 71.18

We used all data in ST and MJ in LMDB format. We haven't changed any code except to import images and labels. By any chance, did you use preprocessing that does not exist in the current code when creating the image file?

And also it's very strange that the score on CUTE dataset is 10% lower than the reported one. Can you guide us in detail on how to reproduce it?

becxer avatar Jan 07 '22 16:01 becxer

Hi, this is strange, I have run the model with different random seed and can get similar results. The training data I use is not the LMDB format, I just download the original version, and for SynthText I clip text word images from the original image mannually. I will try to figure this out by also using the LMDB data recently.

RuijieJ avatar Jan 09 '22 08:01 RuijieJ

Thanks for the response!

I have an additional question about this. (1) When configuring batch, did you do random sampled without considering the ratio from the list of images from MJ data and ST data? (2) Can you guide me to the downloadable link to the train set that was exactly used before?

becxer avatar Jan 10 '22 04:01 becxer

(1) Yes, we simply sampled from the whole data, without considering the ratio from each dataset (2) for MJSynth, please reffer to the offical site, for SynthText, please also reffer to this link

RuijieJ avatar Jan 10 '22 12:01 RuijieJ

Hello, we have a problem with reproducing the results in the paper.

With the official code and the default parameters for training, we are not able to reach the desirable scores except IC03 and IC13.

Method Train Opt Epoch IC03 IC13 IC15 IIIT5k SVT SVTP CUTE PREN(Paper) - - 94.90 94.70 79.20 92.10 92.00 83.90 81.30 PREN(w/ Official code) default 3 95.23 94.52 76.97 84.33 87.33 79.23 71.18 We used all data in ST and MJ in LMDB format. We haven't changed any code except to import images and labels. By any chance, did you use preprocessing that does not exist in the current code when creating the image file?

And also it's very strange that the score on CUTE dataset is 10% lower than the reported one. Can you guide us in detail on how to reproduce it? Hello, have you found the problem? I also used the MJ and ST datasets in LMBD format for training, and got similar results to yours, but didn’t achieve the performance in the paper.

milely avatar Mar 23 '22 09:03 milely

怎么训练两个数据集

qyfff avatar Nov 09 '22 08:11 qyfff