android-ocr icon indicating copy to clipboard operation
android-ocr copied to clipboard

Add preserve_interword_spaces=1 argument

Open languagemaniac opened this issue 2 years ago • 3 comments

Hi, when I OCR Japanese text, it comes out with random spaces.

This can be easily fixed by adding "-c preserve_interword_spaces=1" as an argument when executing tesseract.

I tried it on my PC with the same result. Adding that argument fixes the issue.

For what I've been reading, it's the same for Chinese and Korean, (though I haven't tried with those) so maybe there should be an option to enable / disable that specific argument, as these languages don't have any interword spaces whatsoever.

languagemaniac avatar Oct 23 '23 19:10 languagemaniac

  • I will try to add this in my app

T8RIN avatar Apr 08 '24 20:04 T8RIN

  • I will try to add this in my app

Which app?

languagemaniac avatar Apr 08 '24 20:04 languagemaniac

ImageToolbox

T8RIN avatar Apr 10 '24 11:04 T8RIN