FOTS_TF icon indicating copy to clipboard operation
FOTS_TF copied to clipboard

models

Open DecentMakeover opened this issue 5 years ago • 16 comments

I have trained a model on synth text dataset, the detection is okay, but recognition is very bad.

https://drive.google.com/open?id=1SZaPveIjdhpkQgv6UL75VRi2c_kZIIhr

DecentMakeover avatar Apr 18 '19 07:04 DecentMakeover

@DecentMakeover Simple English letters and Numbers?

zhoujianwen avatar Apr 23 '19 09:04 zhoujianwen

@zhoujianwen What do you mean?

DecentMakeover avatar Apr 23 '19 10:04 DecentMakeover

@DecentMakeover Can the trained model recognize simple Numbers and English characters?

zhoujianwen avatar Apr 23 '19 11:04 zhoujianwen

I think some training tricks are needed. Anyway these are not stated in the paper. I can't find any bugs in my code, but there's still no ideal result. :( I change some details in recognition, you can update the code and try again.

Pay20Y avatar Apr 23 '19 13:04 Pay20Y

@Pay20Y

Thank you very much. I even changed the code to use icdar2015 to retrain the model, and the training did not last long, prompting a warning, but it did not stop and the training continued.

·····
RuntimeWarning:invalid value encountered in float_scalars
······

When I first started, I found that the training took a long time, but the GPU memory was not used very much, I suspected that I used CPU training. I recommend using this command.

CUDA_VISIBLE_DEVICES=0 python2 multigpu_train.py --gpu_list=0 --training_data_path=/path/to/DataSet

Otherwise it's going to be slow. How much memory and graphics memory is best?

zhoujianwen avatar Apr 23 '19 15:04 zhoujianwen

@zhoujianwen Not well, it sometime predicts random letters.

DecentMakeover avatar Apr 23 '19 16:04 DecentMakeover

@DecentMakeover I was running the model when the machine hung up again. Is 8G not enough memory?

zhoujianwen avatar Apr 23 '19 16:04 zhoujianwen

look for this tf.app.flags.DEFINE_integer('batch_size_per_gpu', 8, '') in multigpu_train.py and change accordingly

DecentMakeover avatar Apr 23 '19 16:04 DecentMakeover

@DecentMakeover Thank you very much. It's solved.I have set batch_size_per_gpu to 8, now I can train well, but the memory usage is very high, maybe my machine memory is too small.

zhoujianwen avatar Apr 23 '19 16:04 zhoujianwen

@Pay20Y The accuracy of word recognition is very bad.Why is that?

zhoujianwen avatar Apr 24 '19 13:04 zhoujianwen

@Pay20Y The accuracy of word recognition is very bad.Why is that?

Which testset do you use? If not icdar15 please set use_vocab False in eval.py

Pay20Y avatar Apr 25 '19 01:04 Pay20Y

@Pay20Y Could you help me ?! How to train it on the Arabic language? CHAR.VECTOR contain letters which in Arabic is not like English, Arabic letters are concatenated with each other !!!! Looking forward to your help

AmrMaghraby avatar Jul 23 '19 13:07 AmrMaghraby

@Pay20Y Could you help me ?! How to train it on the Arabic language? CHAR.VECTOR contain letters which in Arabic is not like English, Arabic letters are concatenated with each other !!!! Looking forward to your help Sorry, I'm not familiar with Arabic. You can refer to here. It's a multi-language recognizer.

Pay20Y avatar Jul 24 '19 01:07 Pay20Y

@Pay20Y

Could you help me in evaluate the model it shows me this error I am not experienced in MAkeFile

image

AmrMaghraby avatar Jul 24 '19 09:07 AmrMaghraby

@Pay20Y

Could you help me in evaluate the model it shows me this error I am not experienced in MAkeFile

image

Sorry, I have no idea about it. I copied the NMS code from EAST. Maybe you can find the solution from the issue.

Pay20Y avatar Jul 25 '19 01:07 Pay20Y

@Pay20Y I know this is an old thread, however I came across your repository very recently. I'm trying to replicate results on SynthText data (English for now). I used the E2E-MLT , that model performed decent on detection and very bad on recognition. I was hoping this repo does a better job on SynthText.

Looks like there have already been some changes that make training better, before I run experiments wanted to get your thoughts to see what parameters and tricks are optimal.

Appreciate any help !

mohammedayub44 avatar Apr 30 '21 19:04 mohammedayub44