robotframework-SikuliLibrary icon indicating copy to clipboard operation
robotframework-SikuliLibrary copied to clipboard

Is it possible "Get Text" in specific language using a different tesseract data?

Open romuluc opened this issue 6 years ago • 4 comments

Could I "Get Text" in portuguese ou spanish from OCR? Is there any way to calibrate OCR to improve the quality of obtained text?

romuluc avatar Jan 30 '19 12:01 romuluc

Yes, you can, take a look at this, you can add a traineddata data with your language and get better results, I'm from Brazil and I did this with portuguese, just look at the docs, don't forget to also configure the test cases to run with this configuration, I think there's a keyword for this.

👍

edsonharantes avatar Jan 31 '19 00:01 edsonharantes

Thank you @edsonharantes , @romuluc , "Get Text" is using tesseract, so need to add related trainedata.

rainmanwy avatar Feb 02 '19 07:02 rainmanwy

Hi @rainmanwy, I download chi_sim.traineddata and copy it to folder tessdata where eng.traineddata exists in SikuliLibrary.jar, but it still cann't identify Chinese when I use RobotFramework, Do I need to modify other files?

Thanks in advanced

YiQiSun-Uniques avatar Apr 12 '19 12:04 YiQiSun-Uniques

@YiQiSun-Uniques , currently i could not have a try ocr in my environment. Do you check the link as @edsonharantes suggested?

You may find sikulix folder in your home folder.

rainmanwy avatar Apr 18 '19 01:04 rainmanwy