mhmadaladin comments

Results 11 comments of


                                            mhmadaladin

App Crash when highlight a word on Arabic pdf

Yes, I installed the Arabic Tesseract trained data with all complimentary files from [https://github.com/tesseract-ocr/tessdata/tree/3.04.00](url) as mentioned in the user guide, and it worked normally on other device as mentioned before...

App Crash when highlight a word on Arabic pdf

> > Yes, I installed the Arabic Tesseract trained data with all complimentary files from https://github.com/tesseract-ocr/tessdata/tree/3.04.00 as mentioned in the user guide, and it worked normally on other device as...

App Crash when highlight a word on Arabic pdf

> Have you tried the 3 variants (`tessdata`, `tessdata-best`, `tessdata-fast`)? No, only one, I'll try the other two when I return from Work, thank you for your support

App Crash when highlight a word on Arabic pdf

> I must say I always had a bad experience highlight/lookup'ing in scanned PDF (including in mine made of book page photos made at public libraries, just concatenating the JPG...

App Crash when highlight a word on Arabic pdf

> #12481 I can't get Forced OCR to work even with the correct data copied to that folder. I don't know about Android version, but it worked on my kindle,...

App Crash when highlight a word on Arabic pdf

> > > #12481 I can't get Forced OCR to work even with the correct data copied to that folder. > > > > > > I don't know about...

App Crash when highlight a word on Arabic pdf

> Force OCR means "ignore the embedded text layer because it's beyond atrocious". You presumably rarely want to turn that on at all, and certainly not by default. Yes that's...

App Crash when highlight a word on Arabic pdf

> If there's no text layer OCR is always performed. Force OCR only refers to ignoring the text layer (i.e., forcing OCR even though it's not necessary). > > Is...

App Crash when highlight a word on Arabic pdf

> The recent Tesseract upgrade has changed its behavior a bit in a way we don't (hopefully not can't) deal with yet. The older version used to nearly always return...

App Crash when highlight a word on Arabic pdf

> I can confirm that most words don't seem to be recognized, even with the best model. The issue with Arabic support seems to be known (Cf. [tesseract-ocr/tesseract#2047](https://github.com/tesseract-ocr/tesseract/issues/2047)). > >...