EasyOCR icon indicating copy to clipboard operation
EasyOCR copied to clipboard

Language detection

Open omvishwas opened this issue 2 years ago • 3 comments

Hello Everyone ,

Qu1) I just want to know ,Is it possible to detect language of the text before we parse the image into the EasyOCR ? Qu2) Is there any feature present in the EasyOCR for the automatic text detection ?

Or I can use any other model for the language reconginization?

omvishwas avatar Jul 11 '23 05:07 omvishwas

Well what you could do is use EasyOCR or any other OCR engine to first analyze the text and then pass it through langdetect to make it work.

nash-ajit avatar Jul 17 '23 19:07 nash-ajit

Due to how almost, if not all OCR libraries work. You must know the language used in the image your analyzing in order for the recognizer to work correctly. I've created a project that will resolve that problem in the coming weeks. If your interested, just follow me.

Th3Tr1ckst3r avatar Aug 22 '23 02:08 Th3Tr1ckst3r

We faced a similar issue at work. So we did a first level OCR with standard/basic library that supports multiple languages and used a voting mechanism with multiple methods like: unicode of characters to identify language/script, langdetect etc. And then based on the majority top-2 lang, ran the easyocr or paddleocr depending on which language each worked better with. Hope this helps anyone reading this now.

brownsloth avatar Jun 21 '25 12:06 brownsloth