jbreiden
jbreiden
> Do you have the files to update this repo for 4.0.0? No, I don't. But I have been (and continue to) look into this.
I try hard to make sure Arabic and other right-to-left languages work correctly in Tesseract PDF. As the problem is isolated further I'm happy to look, but I'm not aware...
A quick check shows Chrome gives good results (as per amitdo) and Acroread gives bad results (as per tbadran). This is surprising, I thought we were good with Acroread. I...
Regarding recognition accuracy, that's a better topic for the forum. But in short: Don't compare against Google Drive. Don't expect major accuracy improvements unless/until Ray is successful with his ideas....
I find it a little easier to test with Hebrew because the letters do not connect. Tesseract version 3.03 behaves the same, so this is not a regression. Will need...
There are two things I can think of doing. One is to give up and write Arabic backwards (which I really hate!). The other is to put an entry in...
@amitdo Hebrew has the exact same problem as Arabic.
That's another possibility, thanks for the suggestion.
I am taking a look at this today. With current code, copy-paste works from Chrome, fails from Adobe Reader. Destination is gEdit. All tests are on Linux. I see no...
Interesting suggestion. If correct, why would it show up as an n - 1 problem in highlighting?