tesstrain
tesstrain copied to clipboard
Train Tesseract LSTM with make
Hi team, I'm training a model on some font with `START_MODEL=eng` and while the resulting .traineddata can correctly guess a lot of things with the font there are some which...
I tried to recognize some old Fraktur texts with deu_latf, but there are many words that are not recognized correctly, so I extracted the word list from deu_latf. This file...
I tried to train Tesseract 5 with a new font in Thai but The BCER value keeps increasing. This is the detail Font : TH Sarabun New (200 samples) Base...
This is the screenshot from jTessBoxEditor: data:image/s3,"s3://crabby-images/93734/9373463339f3d3aecdb159237ba012a394ba9168" alt="image" The provided [example training files](https://github.com/tesseract-ocr/tesstrain/blob/main/ocrd-testset.zip) in this repo seems building a whole line of image & text pairs, other than character-by-character. Then my...
When **centos** uses source code compilation to install tesseract, three dependent packages are missing. However, these three dependent packages have been installed and the versions fully meet the requirements. This...
When trying to fine tune model, i get Failed to read data errors and then assert failed error ``` C:\Users\tobik\source\repos\tesstrain>make training MODEL_NAME=ocrd-testset START_MODEL=ces TESSDATA=C:\tessdata You are using make version: 4.4.1...
I'm working with `tesseract-4.1.1` and trying to do training `(fine-tuning)` for this I have followed steps: 1. Downloaded `eng.traineddata` from `tessdata_best `and pasted it into `/usr/share/tesseract-ocr/4.00/tessdata`. 2. Then I've created...
Hi, I followed the tutorial, I extracted the content of `ocrd-testset.zip` in `./data/foo-ground-truth`, but when I run: > make training I get the following output: ``` (python) manuelarte@maclaptop tesstrain %...
If i have a dataset for which the model is not performing well. The dataset is on word level then can i train the tesseract model on the same dataset.
I am actually working on training Grantha script. What I figured out is that both Tamil and Malayalam have almost same characters and formatting as that of Grantha. So, I...