tesseract icon indicating copy to clipboard operation
tesseract copied to clipboard

Tesseract Open Source OCR Engine (main repository)

Results 218 tesseract issues
Sort by recently updated
recently updated
newest added

In example.zip, 7 tif file in it. I merge all tif to zq.newbox.exp0.tif and generate zq.newbox.exp0.box. Then I execute this command in windows CMD, I want to generate lstm file...

training

Make sure that the empty vector's data is never used, `at()` will throw an exception in that case. This may allow compilers to better optimize code.

Tesseract currently has problems when the path of the executable contains Unicode characters which are not supported by the current code page. I also expect problems for any filenames given...

feature request
unicode

### Environment * **Tesseract Version**: Tesseract 4.1.1 * **Platform**: Win10 64bit, VS2017, MFC C++ application ### Current Behavior: **tesseract::TessBaseAPI::ProcessPages** cannot be stopped on demand. I didn't discover any way to...

feature request

As noted in the [documentation ](https://tesseract-ocr.github.io/tessdoc/ImproveQuality.html#rotation--deskewing), Tesseract performs poorly when the page is at an angle (not a multiple of 90 degrees). This limitation is not problematic from an accuracy...

feature request

Hi By Using Tesseract OCR to recognize text from image is works normal but for some time execution of following line reaches 450-500 ms with 3-5 char detection . char*...

performance

Hello, I am trying to train form scratch/fine tune tesseract for "Jameel Noori Nastaleeq" font for Urdu. The steps i did for training from scratch: 1. Create unicharset from all...

training
encoding failed

In Devanagari script, a virama is used to kill the inherent vowel of a consonant. When followed by another consonant, it forms a conjunct form. Depending on the font used,...

training
traineddata
encoding failed

### Environment * **Tesseract Version**: tesseract 5.0.0-alpha-20210401-123-g5eb2e8 * **Commit Number**: * **Platform**: either linux ubuntu 18.4 ### Current Behavior: Training command : ./tesstrain.sh --fonts_dir fonts --fontlist 'B Nazanin' --lang fas...

bug
training
encoding failed