Results 35 issues of Amit Dovev

https://github.com/tesseract-ocr/tessdata/tree/3a94ddd47be0 @theraysmith , How to present those 'best' files to our users? https://github.com/tesseract-ocr/tesseract/wiki/Data-Files Do you plan to push more updates to the best directory and/or to the root dir in...

question

Here, i'm going to raise some issues related to Tesseract's Hebrew support. Dear participants who have interest in Arabic support, I suggest to raise Arabic issues in a separate 'issue',...

From https://github.com/tesseract-ocr/tesseract/issues/40 @stweil commented >Are there also new data files planned for old German (deu_frak)? I was surprised that the default English model with LSTM could recognize some words. @theraysmith...

Copied from 59: ----------------------------------------- @Shreeshrii commented ​ Just checking whether this new training will also address: 2. Correct handling of superscripts ----------------------------------------- @theraysmith commented 2. Correct handling of superscripts Beyond...

https://github.com/cneud/ocr-gt

From #82 @theraysmith commented >OK I have added desired/forbidden characters for heb and yid I assume that apart from the 3 unique characters that you listed (for each) the list...

Copied from 59 ------------------------------------------------ [reply to @Shreeshrii] @theraysmith commented TM is also difficult, as it is in conflict with the needs of fi/fl, which should not appear in the output.

Hi, What's the license of this project?

This will only parse `include/tesseract`. Maybe make this the default, including here: https://tesseract-ocr.github.io/tessapi/5.x/files.html

documentation

https://github.com/tesseract-ocr/tesseract/issues/518#issuecomment-277514434 >@stweil commented on 5 Feb 2017 > >There are different approaches possible to get support for big endian machines: > >1. Write training data files in native endian byte...

endianness
enhancement
priority: low