EasyOCR icon indicating copy to clipboard operation
EasyOCR copied to clipboard

Any advice on how to improve performance on custom datasets?

Open Seoung-wook opened this issue 11 months ago • 1 comments

The final goal is to increase the recognition rate of Korean and English in Korean, English, Chinese, etc. In the case of the character area, I know that it is recognized in units of sentences. Which do you think will be a better dataset for training, word units or sentence units? Also, is it possible to learn with a Korean-applied dataset and apply it together with an English recognition model?

Seoung-wook avatar Aug 10 '23 04:08 Seoung-wook

As per documentation and sample dataset you need a word unit for training but I assume you can use sentences units but CRAFT will identify the words so there is no point using sentence.

yasaslive avatar Sep 25 '23 22:09 yasaslive