Uwe Hartwig
Uwe Hartwig
I've been talking with https://github.com/galdring , a colleague, about this review and he's out to get us somebody.
There's also a branch with the same name (`feat/generate-trainingsets`) but outdated already in this repository, which I guess @kba created to commit his extensions before I integrated them and they...
I already have OCR-ALTO from Tesseract together with PDF-Files that currently lack textlayers, although they have a table of contents and some metadata (which I'd like also to modify, and...
@Shreeshrii I wonder if/how the ordering of punctuation chars affects training. Given a line image like https://github.com/OpenITI/OCR_GS_Data/blob/master/ara/book_IbnFaqihHamadhani.Buldan/7_final_b/a_000716.png, compared with it's transcription (https://github.com/OpenITI/OCR_GS_Data/blob/master/ara/book_IbnFaqihHamadhani.Buldan/7_final/a_000004.gt.txt), it seems to me the double colon is...
Maybe currently it's convenient to eliminate punctuation from training data? Our focus is on letters. The PR-Request https://github.com/tesseract-ocr/tesstrain/pull/205 tries to sanitize this by wiping off any RTL-unicode direction marks, which...
Hello! I managed a similar case by using custom validations: ``` let opts = { custom: { my_select_valiadator: function ($el) { if($el.val() == "" ) { return "please select something!";...
I'll try to do so. Actually I'm somewhat puzzled, since the text concerning this issue on https://github.com/funkyfuture/deck-chores/blob/main/docs/usage.rst differs from what is published as official doc at https://deck-chores.readthedocs.io/en/stable/usage.html#job-definitions. Which one shall...
The Link from send.firefox.com is active for 1 day. Afterwards it will disappear.
@zdenop We're scanning from Microfilms using QuantumScan Software and do Preprocessing with QuantumProcess which does a good job with constrasts and deskewing. Therefore I wonder why 999 Images pass, but...
@zdenop Yes, many thanks, this works! But anyway, I wonder why all the other images went fine. They are all born the same way. Never cropped, just plain TIF-files with...