tesstrain issues

./plot/plot_cer.sh is missing!

2

plot_cer.sh file is not created after make training so how to extract plot_cer.csv file from TESSTRAIN.LOG

MeilyOeng

how to prepare the data for new tessdata images in khmer lang

1

how to prepare the data for new tessdata images in khmer

mengleang-ngoun

number of MAX_ITERATIONS

3

Is this still the case: https://groups.google.com/g/tesseract-ocr/c/AnMYS98VwiE/m/1PN3mF6PAgAJ The MAX_ITERATIONS depends on the number lstmf files? If I have 1 millions pairs of images and text ground truth for training from scratch,...

whisere

question

Migrate Python code to a dedicated package

10

This is my attempt to migrate the existing code for working with artificial training data to a dedicated Python package, as proposed in #308 and #307. This includes some additional...

stefan6419846

Failed to read boxes

8

I am trying to use the tool and just run the tutorial setup. However when running `make training` i get an error ``` Failed to read boxes from data/foo-ground-truth/alexis_ruhe01_1852_0018_022.tif ```...

NoxideLive

bug

use norm_mode 1 as default

9

Not sure if this is related to #53: why does the current default `NORM_MODE` set 2 for non-Indic, non-RTL languages? Shouldn't this be 1? Also, the decision tree looks quite...

bertsky

pinned

explicate .lstm-unicharset and my.unicharset prereqs for finetuning

17

(because training fails if a .unicharset has already been created previously, but for a different START_MODEL)

bertsky

pinned

training failed for persian language with new font

22

Dear All, I am trying to train the tesseract with new font ("B Nazanin" attached to the issue) here is my steps, and I am using the `langdata_lstm` git and...

mohsenomidi

Missing config in new created traineddata

1

Hi. I'm trying to take the jpn_vert traineddata, and further train it with my own images. My command to run it looks like this: ``` make training MODEL_NAME=my-custom-model START_MODEL=jpn_vert TESSDATA=$TESSDATA...

MPQC

enhancement

Python package for tesstrain.py

The Python approach in *src/training* requires additional effort to be used from within own Python code, as there is no PyPI package for it. It would be great to make...

stefan6419846

enhancement

tesstrain
tesstrain copied to clipboard

Metadata

./plot/plot_cer.sh is missing!

how to prepare the data for new tessdata images in khmer lang

number of MAX_ITERATIONS

Migrate Python code to a dedicated package

Failed to read boxes

use norm_mode 1 as default

explicate .lstm-unicharset and my.unicharset prereqs for finetuning

training failed for persian language with new font

Missing config in new created traineddata

Python package for tesstrain.py

← Metadata

Owner

Metadata

tesstrain tesstrain copied to clipboard

Metadata

← Metadata

Owner

Metadata

tesstrain
tesstrain copied to clipboard