jochre
jochre copied to clipboard
problems with columns
ocr didn't manage to recognise the braking of lines in a text with two columns.
see screenshot.
Thanks, we're working on the training corpus and software for Jochre 3, and including Zalman Reyzen's lexicon in the training corpus for segmentation. If you find any other badly segmented works, report them here, and we'll include them in the training corpus.
Found one, coincidently also a Reisen! But Avrom
https://tinyurl.com/reisen-eybike-sheynhayt
middle hit. A book with poems in two columns.
goes wrong in search results and in OCRed view of tekst.
Examples is: https://www.yiddishbookcenter.org/collections/yiddish-books/spb-nybc200201?book-page=76&book-mode=1up
found another, also from Reisens leksikon (leksikonfund00rejz)