tesseract icon indicating copy to clipboard operation
tesseract copied to clipboard

Tesseract Open Source OCR Engine (main repository)

Results 218 tesseract issues
Sort by recently updated
recently updated
newest added

This will only parse `include/tesseract`. Maybe make this the default, including here: https://tesseract-ocr.github.io/tessapi/5.x/files.html

documentation

https://github.com/tesseract-ocr/tesseract/issues/518#issuecomment-277514434 >@stweil commented on 5 Feb 2017 > >There are different approaches possible to get support for big endian machines: > >1. Write training data files in native endian byte...

endianness
enhancement
priority: low

leptonica moved most of their type definitions to internal headers. Long term usages of internal API should be removed, but for now simply include the internal header to allow tesseract...

The unittests need some lstmf files which are there in the repo https://github.com/tesseract-ocr/test/tree/master/testdata, which is used as a submodule in tesseract. The unittests run ok on ppc64le (lttle endian) but...

bug
training
endianness
unit tests
priority: low

I had to move [my previous pull-request](https://github.com/tesseract-ocr/tesseract/pull/3599) (my solution to https://github.com/tesseract-ocr/tesseract/issues/3590) to a new branch. I couldn't do that without making this new pull-request.

https://github.com/tesseract-ocr/tesseract/issues/3871

While building the tesseract library using CMake and Xcode , it throws error with undefined symbol for x86_64 Undefined symbols for architecture x86_64: "tesseract::DotProductNEON(float const*, float const*, int)", referenced from:...

build process

I want to create tiff images from 55 fonts using this code: ``` rm -rf train/* tesstrain.sh --fonts_dir font \ --lang fas \ --noextract_font_properties --linedata_only \ --langdata_dir langdata_lstm\ --tessdata_dir tesseract/tessdata...

training
text2image

### Environment * **Tesseract Version**: tesseract 4.1.1 leptonica-1.79.0 * **Commit Number**: installed through ` apt install tesseract-ocr` * **Platform**: Linux DESKTOP-xxxxxxx 5.10.102.1-microsoft-standard-WSL2 (Ubuntu 20.04) ### Current Behavior: Tesseract fails to...

multilingual ocr

### Environment * **Tesseract Version**: tesseract 5.2.0 leptonica-1.82.0 libgif 5.2.1 : libjpeg 8d (libjpeg-turbo 2.1.3) : libpng 1.6.37 : libtiff 4.4.0 : zlib 1.2.11 : libwebp 1.2.4 : libopenjp2 2.5.0...

legacy