tessdata_best icon indicating copy to clipboard operation
tessdata_best copied to clipboard

Polytonic characters on Greek trained data files

Open kchrs opened this issue 6 years ago • 0 comments
trafficstars

It looks like that although ell.traineddata should contain only modern Greek characters after OCR-ing multiple tiff files written in modern Greek output text contains (old) polytonic characters eg: "εἶναι " This is old style Greek. I thought there is another language file for ancient Greek (grc.traineddata) and not the ell.traineddata one. Polytonic writing should be removed

kchrs avatar Dec 13 '18 11:12 kchrs