Language detection
Hello Everyone ,
Qu1) I just want to know ,Is it possible to detect language of the text before we parse the image into the EasyOCR ? Qu2) Is there any feature present in the EasyOCR for the automatic text detection ?
Or I can use any other model for the language reconginization?
Well what you could do is use EasyOCR or any other OCR engine to first analyze the text and then pass it through langdetect to make it work.
Due to how almost, if not all OCR libraries work. You must know the language used in the image your analyzing in order for the recognizer to work correctly. I've created a project that will resolve that problem in the coming weeks. If your interested, just follow me.
We faced a similar issue at work. So we did a first level OCR with standard/basic library that supports multiple languages and used a voting mechanism with multiple methods like: unicode of characters to identify language/script, langdetect etc. And then based on the majority top-2 lang, ran the easyocr or paddleocr depending on which language each worked better with. Hope this helps anyone reading this now.