tessdata icon indicating copy to clipboard operation
tessdata copied to clipboard

Tesseract 4 failed to recognise some words in tabular data

Open engahmed1190 opened this issue 5 years ago • 0 comments

I suspect this is a bug from the trained data. I would like to have suggestions if possible to solve this


Environment

  • Tesseract Version: tesseract 4.0.0-beta.1-306-g45b11cd
  • Platform: Ubuntu 16.04.4 LTS

I have this image I want to detect all the text inside it.

0

The detected text missing important words as PO and some values. 0.pdf

I am interested to know how to overcome this effect.

I am using this traineddata : https://github.com/tesseract-ocr/tessdata/blob/master/script/Latin.traineddata Thanks

engahmed1190 avatar Jul 28 '18 16:07 engahmed1190