mhmadaladin
mhmadaladin
Yes, I installed the Arabic Tesseract trained data with all complimentary files from [https://github.com/tesseract-ocr/tessdata/tree/3.04.00](url) as mentioned in the user guide, and it worked normally on other device as mentioned before...
> > Yes, I installed the Arabic Tesseract trained data with all complimentary files from https://github.com/tesseract-ocr/tessdata/tree/3.04.00 as mentioned in the user guide, and it worked normally on other device as...
> Have you tried the 3 variants (`tessdata`, `tessdata-best`, `tessdata-fast`)? No, only one, I'll try the other two when I return from Work, thank you for your support
> I must say I always had a bad experience highlight/lookup'ing in scanned PDF (including in mine made of book page photos made at public libraries, just concatenating the JPG...
> #12481 I can't get Forced OCR to work even with the correct data copied to that folder. I don't know about Android version, but it worked on my kindle,...
> > > #12481 I can't get Forced OCR to work even with the correct data copied to that folder. > > > > > > I don't know about...
> Force OCR means "ignore the embedded text layer because it's beyond atrocious". You presumably rarely want to turn that on at all, and certainly not by default. Yes that's...
> If there's no text layer OCR is always performed. Force OCR only refers to ignoring the text layer (i.e., forcing OCR even though it's not necessary). > > Is...
> The recent Tesseract upgrade has changed its behavior a bit in a way we don't (hopefully not can't) deal with yet. The older version used to nearly always return...
> I can confirm that most words don't seem to be recognized, even with the best model. The issue with Arabic support seems to be known (Cf. [tesseract-ocr/tesseract#2047](https://github.com/tesseract-ocr/tesseract/issues/2047)). > >...