tesstrain icon indicating copy to clipboard operation
tesstrain copied to clipboard

Train Tesseract LSTM with make

Results 63 tesstrain issues
Sort by recently updated
recently updated
newest added

@zdenop I have come to the last hurdle! I can finish training the model successfully. I like to make the trained model from files in checkpoints dir from the instruction....

I would like to know how the [default model](https://github.com/tesseract-ocr/tessdata_best) is trained. If it is trained with several images (if so, what order of magnitude), or if images are generated automatically...

stale

Hi, I'm using last main branch and trying to train Japanese, prepared some text from news but it generated the following error If remove all Japanese Zenkaku characters, which means...

stale

I have prepared the following ground truth files: ``` ../tesstrain/data/Chechen-ground-truth |-- 1.box |-- 1.gt.txt |-- 1.png |-- 10.box |-- 10.gt.txt |-- 10.png |-- 11.box |-- 11.gt.txt |-- 11.png |-- 12.box...

stale

Hi there. Just want to share how I managed to run tesseract training with tesstrain on version 5. It might help other and I hope can be used to improve...

stale

Can tesstrain use in emoji? I have got error: ``` Encoding of string failed! Failure bytes: fffffff0 ffffff9f ffffff80 ffffff87 Can't encode transcription: '🀇' in language '' ```

Hello, I followed the training procedure, there I generated the `.gt.txt` and `.box` files for the line images with help of tesseract Then, I corrected/annotated the `.gt.txt` and `.box` files...

In general, the documentation provided in README.md is very vague, and doesn't explain the training parameters and their impact on the output model. Apart from the above, the information provided...

enhancement

Hi, Is there any configuration required to use the arabic handwriting traineddata, please? I found them here, https://ub-backup.bib.uni-mannheim.de/~stweil/ocrd-train/data/ArabicHandwritingOCRD/tessdata_best/ Should I download all of them or only the latest version?

question
stale

I'm trying to create language data as the instructions say, but I can't do it Is there a way to solve this problem? Thank you ![Screenshot](https://github.com/tesseract-ocr/tesstrain/assets/39112555/ed51ba7e-c6dd-4bc6-8419-ca3787423f81) (.venv) PS D:\ocr\tesstrain> make...