tessdata_best
tessdata_best copied to clipboard
Polytonic characters on Greek trained data files
trafficstars
It looks like that although ell.traineddata should contain only modern Greek characters after OCR-ing multiple tiff files written in modern Greek output text contains (old) polytonic characters eg: "εἶναι " This is old style Greek. I thought there is another language file for ancient Greek (grc.traineddata) and not the ell.traineddata one. Polytonic writing should be removed