parseq icon indicating copy to clipboard operation
parseq copied to clipboard

Training model on japanese dataset

Open Preethse opened this issue 2 years ago • 4 comments

I created my own dataset of 14M images and started training on it. Weirdly the results are like this after first epoch

Screen Shot 2023-01-26 at 15 30 09

Really would appreciate feedbacks to improve my training.

Preethse avatar Jan 26 '23 06:01 Preethse

Hi, can this model be used to identify languages, such as Chinese, Japanese, etc.

Chenxx017 avatar Feb 03 '23 06:02 Chenxx017

https://github.com/bharatsubedi/PARseq_torch check in this repo you just update the character set of language you need you will able to train network.

lerndeep avatar Mar 27 '23 05:03 lerndeep

@Preethse try early stopping. Your training is collapsing after a while. You could also try tuning the learning rates. The training dynamics drastically changes if you change the training data, so the default hyperpameters might not work well for you.

Also, for experimentation, use a bigger model, like the base Transformer configuration (d=768).

baudm avatar Mar 28 '23 04:03 baudm

@Chenxx017 Sorry, but no. That's outside the scope of this project.

baudm avatar Mar 28 '23 04:03 baudm