Robert Sachunsky
Robert Sachunsky
> The word "Wiener" is just a snippet from a 1238 × 388 px image, but I now realize that I forgot to set the DPI. Wow! They rarely come...
> from here: https://sourceforge.net/p/localcontrast/code/ci/default/tree/ctmf.c , line 362. > > 7/1 = white/black ratio when using "Stempel Garamond" Font (black on white) with reasonable leading. This is what I am used...
> Some interesting binarization with much more equations someone else pointed me to: https://arxiv.org/pdf/2007.07350.pdf Yes, we've briefly discussed that in the Lobby. [Here](https://github.com/jonbarron/hist_thresh) is the implementation. Unfortunately does not combine...
> So I think we need a different artificial test bed. As a first step, @jbarth-ubhd could you please change your code to do each point in your matrix (i.e....
> Yes, this is was I've done. Process the "full" 1238 × 388 px image (from PDF, 300 DPI, DIN A7) and extract the word "Wiener" for compact comparison. Oh,...
> Feel free to do anything with it. Sorry no docs. "gen" generates various bXXXwXXXnX.ppm from Beethoven-testtext.pgm downsampling 25%. Sorry width+hight hard coded in gen.c++. Convert it to .tif ....
Would be interesting to see how `sbb-binarize` copes with normalized and with cropped images. But the message is already clear: Good neural modelling is superior. > I think show-through can't...
> Just for completeness: ocropus-nlbin with `-n`; not normalized before: Should be the same as `ocrd-cis-ocropy-binarize`, right?
@jbarth-ubhd I'm not sure what you want to say with this. But here's a comparison of both wrappers for old ocropus-nlbin: | anybaseocr-binarize | cis-ocropy-binarize | | --- | ---...
> > Should be the same as ocrd-cis-ocropy-binarize, right? > > Didn't know if orcropus-nlbin is the same as cis-ocropy-binarize, so tried to find out and found some lines that...