Amit Dovev comments

Results 538 comments of


                                            Amit Dovev

Geresh and Gershayim are not included

I think 'desired_words' and 'forbidden_words' can also be used.

Geresh and Gershayim are not included

True.

Geresh and Gershayim are not included

https://github.com/tesseract-ocr/tessdata/issues/62#issuecomment-319839971 theraysmith commented on Aug 3, 2017 >FYI: The wordlists are generated files, so it isn't a good idea to modify them, as the modifications will likely get overwritten in...

Geresh and Gershayim are not included

vie has 'alphabet' file: https://github.com/tesseract-ocr/langdata/blob/master/vie/alphabet

Add vulgar fraction for 1/2

Pango, which is what we use to render the images with text2image, supports MathML.

Add vulgar fraction for 1/2

>Now we only need a Tesseract which can detect formulae in images https://github.com/tesseract-ocr/tesseract/blob/master/ccmain/equationdetect.h

Add Filipino lang

https://github.com/tesseract-ocr/tessdata/raw/master/best/fil.traineddata

Add Filipino lang

>I've tried adding it to the language folders but when selecting fil as language the app always shut down. You should try running Tesseract from the command-line.

Improve yor.traineddata for Yoruba

Making screenshots is not very useful. You need the text itself. A web crawler is what you need to use. Please list the URLs of those two sites. Did you...

Improve yor.traineddata for Yoruba

The images for trained data are created by the text2image tool. It renders images from text files using variety of digital fonts.