Robert Sachunsky comments

Results 735 comments of


                                            Robert Sachunsky

frak models in ocrd resmgr

> Those additional components are based on the components from a Tesseract standard model (as far as I remember on `Fraktur.traineddata`, but I'd have to check) No, the latter word...

frak models in ocrd resmgr

> Of course it would be preferable to have a standard dictionary for (say) 18th century German. We could export the fullforms from DTA lexdb, for example. (But this must...

frak models in ocrd resmgr

> * >10: 314248 words > * >50: 100516 words > * >100: 60403 words > I will try to use this with frak2021, but also GT4HistOCR and others. Done:...

frak models in ocrd resmgr

> In my tests frak2021 is much better than GT4HistOCR, so using it with GT4HistOCR might not be worth the efforts. > It would be more interesting to use it...

frak models in ocrd resmgr

Indeed – something went wrong. Thanks @jbarth-ubhd, I'll investigate!

Ok, I found the problem. See [new release](https://github.com/bertsky/dta-lexdb-applications/releases/tag/v0.2). ``` 346632 lines 16.37 % lines with »ſ« 0.19 % lines all-UPPERCASE 132.80 % lines ambigious ``` What's with the > 100%...

frak models in ocrd resmgr

> **a lot of spaces** after words(?). wow, I should have checked. Thanks again for being thorough @jbarth-ubhd – much appreciated! see [new release](https://github.com/bertsky/dta-lexdb-applications/releases/tag/v0.3) > And not NFC (double counting,...