tesstrain
tesstrain copied to clipboard
Training for fonts?
I've seen some outdated docs for training tesseract with specific fonts.
What's the current way of doing this? Pointers would be appreciated :)
You can try the scripts in https://github.com/Shreeshrii/tess5train-fonts
These work for me. Please be prepared to modify the bash scripts as needed.
If you want to fine tune a model for a specific font using artificial data, the scripts in https://github.com/tesseract-ocr/tesstrain/tree/main/src/training mostly are a drop-in replacement for the old tesstrains.h script.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.