Robert Sachunsky
Robert Sachunsky
@stefanCCS as I explained [here](https://github.com/OCR-D/ocrd_anybaseocr/pull/89#issuecomment-1046333592), you have to set a different `resize_mode` and `resize_width` for better quality (resolution), esp. with such large images. Also see improved help text.
> btw: I get this warning here - relevant ? no, not relevant. (string vs int mapping of interpolation types does work.)
> * Result: One example works better, the other one gets "out-of-memory" with more than 8GB RAM to try to allocate Yes, it's memory-intensive for sure. That's by design, unfortunately....
> raise Exception("File with ID='%s' already exists" % ID) thanks for the report – can you please try again with newest #91?
> For us this is not a huge page, it is a normal one - we sometimes get A1 newspapers with 600dpi - this is huge ;-) I see. I...
> I would guess that the training has not seen this kind of strong warped examples. > Maybe something to be improved? Yes, that's exactly what I would assume. Unfortunately,...
> This brings me back to the idea, that ocrd-kraken-segment should learn "Regions-level". And with this lines, the dewarping should work ok. > What do you think? You mean the...
Understood. Another thing that this page or guide should mention is converters for page segmentation training data. With `ocrd-segment-from-masks` and `ocrd-segment-from-coco` we have 2 importers and with the debug images...
Can perhaps be closed – there's a section on the ocrd_segment converters in https://ocr-d.de/en/workflows#step-19-format-conversion now. (And page2img is independent of OCR-D _and_ most OCR tools: tesstrain will probably [include its...
> Hmm.. we are talking about https://ocr-d.de/en/workflows ? > > The "Recommendations" at the end of that page? Yes! > But in Step 5 cis-ocropy-deskew is mentioned ( but not...