tesstrain icon indicating copy to clipboard operation
tesstrain copied to clipboard

Train Tesseract LSTM with make

Results 63 tesstrain issues
Sort by recently updated
recently updated
newest added

I've seen some outdated docs for training tesseract with specific fonts. What's the current way of doing this? Pointers would be appreciated :)

question
stale

Arch Linux, ``` tesseract 5.0.0-alpha-20210401-158-ge1761 leptonica-1.81.0 libgif 5.2.1 : libjpeg 8d (libjpeg-turbo 2.1.0) : libpng 1.6.37 : libtiff 4.3.0 : zlib 1.2.11 : libwebp 1.2.0 Found AVX2 Found AVX Found...

bug

I have used the `tesstrain.sh` approach (including `tesstrain_utils.sh` and `language-specific.sh`) for fine tuning an existing model for a specific font in the past. As this is deprecated with the corresponding...

The script works for line level images. I have a number of scanned page images with ground truth files. Does OCR-D project have any tools to segment it to line...

enhancement

Similar to [@stweil's training for Fraktur](https://github.com/tesseract-ocr/tesstrain/issues/73), I am collecting here info regarding finetune RTL training with [OCR_GS_Data for Arabic](https://github.com/OpenITI/OCR_GS_Data/tree/master/ara). Some of this has already been reported [elsewhere in other threads](https://github.com/tesseract-ocr/tesseract/issues/2669#issuecomment-559106006)...

pinned

Hello! I wanted to ask if it would be possible to train Tesseract to recognize the handwriting of a person. I have a collection of old handwritten letters by one...

question
pinned

Porting from https://github.com/tesseract-ocr/tesseract/pull/3434 (not merged) . >This Pull Request adds --vertical_fontlist option to tesstrain.sh to specify a list of fontnames to render vertical text. >The format for specifying a list...

enhancement

By default tesstrain builds vanilla tesseract / lstmtraining, which IINM links against OpenMP. I know @stweil argued repeatedly for disabling OpenMP for prediction in the mass production / batch scenario,...

pinned

My system info: - OS: Ubuntu Desktop 18.04 LTS (4.15.0-55-generic) Hi. I am beginner and am trying to train some Korean character images for Korean recognition. To understand how to...

pinned

Include generation of Trainingdata Sets from OCR like ALTO V3, PAGE 2013, PAGE 2019 and Image Files (tif, jpeg)

pinned