tessdata_best icon indicating copy to clipboard operation
tessdata_best copied to clipboard

Best (most accurate) trained LSTM models.

Results 32 tessdata_best issues
Sort by recently updated
recently updated
newest added
trafficstars

I'd like to upload the errornously recognized figures here But it seems github cannot show it directly, so I've uploaded it to a repositoary. The errornously recognized file can be...

While the files in the top directory seem to come from the sources in the [langdata](https://github.com/tesseract-ocr/langdata) repository, the source for some of the files in `scripts/` is unclear: * `scripts/Fraktur.traineddata`...

When the language is set to "hun" (Hungarian), Tesseract is unable to recognize the % sign. This sign is very commonly used in Hungarian to represent percentages, the same way...

bug

If understand correctly, traineddata files that starts with a capital letter are "mixed languages" traineddata (e.g Hebrew = heb+eng). Was is produced by combining "heb" and "eng" traineddata files or...

It looks like that although ell.traineddata should contain only modern Greek characters after OCR-ing multiple tiff files written in modern Greek output text contains (old) polytonic characters eg: "εἶναι "...

### Environment Tesseract Version: Tesseract 32 bits version: tesseract v4.0.0-beta.1.20180608 leptonica-1.75.3 libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.3) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1...

See related issue in langdata. https://github.com/tesseract-ocr/langdata/issues/116

Hi, I created a test text data mostly (made up individual characters. see attachment) and converted it to tiff file using 'jTessBoxEditorFX' with font 'noto sans telugu 8pt'. I then...

I'm using tesseract for reading mrz codes and sometimes it gives me incorrect symbols eg. instead of "I" it gives me "1" or instead of "5" it gives "S"