tesstrain
tesstrain copied to clipboard
Train Tesseract LSTM with make
plot_cer.sh file is not created after make training so how to extract plot_cer.csv file from TESSTRAIN.LOG
how to prepare the data for new tessdata images in khmer
Is this still the case: https://groups.google.com/g/tesseract-ocr/c/AnMYS98VwiE/m/1PN3mF6PAgAJ The MAX_ITERATIONS depends on the number lstmf files? If I have 1 millions pairs of images and text ground truth for training from scratch,...
This is my attempt to migrate the existing code for working with artificial training data to a dedicated Python package, as proposed in #308 and #307. This includes some additional...
I am trying to use the tool and just run the tutorial setup. However when running `make training` i get an error ``` Failed to read boxes from data/foo-ground-truth/alexis_ruhe01_1852_0018_022.tif ```...
Not sure if this is related to #53: why does the current default `NORM_MODE` set 2 for non-Indic, non-RTL languages? Shouldn't this be 1? Also, the decision tree looks quite...
(because training fails if a .unicharset has already been created previously, but for a different START_MODEL)
Dear All, I am trying to train the tesseract with new font ("B Nazanin" attached to the issue) here is my steps, and I am using the `langdata_lstm` git and...
Hi. I'm trying to take the jpn_vert traineddata, and further train it with my own images. My command to run it looks like this: ``` make training MODEL_NAME=my-custom-model START_MODEL=jpn_vert TESSDATA=$TESSDATA...
The Python approach in *src/training* requires additional effort to be used from within own Python code, as there is no PyPI package for it. It would be great to make...