tesseract icon indicating copy to clipboard operation
tesseract copied to clipboard

issue of reading -1 as 4 on images

Open Hussin22 opened this issue 2 years ago • 4 comments

hello Tesseract ,

i do ocr reading with java using tesseract library , but only an issue with hyphen always '-1' readed as '4' , below the readed example image : https://i.ibb.co/vxDrv2y/PN.png

this readed as : 23267764.

Hussin22 avatar Jan 26 '23 08:01 Hussin22

i m using java with tess4J 3.4.8 and eng.traineddata download from github

i need to read only numbers from 0-9 , and also '-' , 'L'

Hussin22 avatar Jan 26 '23 08:01 Hussin22

Please respect guidance for posting issues - we do not provide support for 3rd party tools. Replicate problem with tesseract executable and provide necessary information (tesseract version, which trainneddata you used...)

zdenop avatar Jan 26 '23 09:01 zdenop

i use this below version : <groupId>net.sourceforge.tess4j</groupId> <artifactId>tess4j</artifactId> 5.5.0

and for trained data i use : eng.traineddata

Hussin22 avatar Jan 26 '23 11:01 Hussin22

One again - we do not support tess4j (3rd party solution) - replicate problem with recent tesseract executable.

zdenop avatar Jan 26 '23 12:01 zdenop