Robert Sachunsky

Results 735 comments of Robert Sachunsky

> How up-to-date is that statement? Not up to date at all. And with https://github.com/OCR-D/ocrd_all/pull/362 it does in fact work with all modules. But we can simply replace 3.8 in...

That particular tail is devilishly long: If you want to build scikit-geometry from source, you need a system library called `libcgal-dev` – but at least version 5 [IIUC](https://github.com/scikit-geometry/scikit-geometry/issues/60). However, installing...

> I really should trash requirements/* in favor of one central conda-based `environment.yml`, which then relies on https://anaconda.org/conda-forge/scikit-geometry for scikit. Please don't remove these lists without some other mechanism for...

Sry, did not see this earlier. But I had the exact same use case. It's not so difficult to properly handle PAGE reading order in XSLT 1.0. This was solved...

> I am not sure that we need git submodules with which a lot of people also struggle to use. Moreover, this would only bee a partial update mechanism because...

No need for any of this, entirely, since we have been using https://github.com/kba/page-to-alto for this purpose instead since https://github.com/UB-Mannheim/ocr-fileformat/pull/134. I suggest closing (cannot do it myself).

BTW the existing integration of GCV as part of the PRImA converter (transform `gcv page` linking to `alto page`) is **broken**: it delegates to `java -jar PageConverter.jar -source-xml $INFILE` instead...

> So it was broken right from the beginning (commit [7332869](https://github.com/UB-Mannheim/ocr-fileformat/commit/73328691c466057566db62d8cdbea8b26823bdbb)). I'm not sure. Perhaps the PRImA convert was capable of detecting the format automatically before. But it does not...

> I tried it with fixed arguments, and it fails: I know. That's because in this example, the input data is incomplete. See [here](https://gitter.im/OCR-D/Lobby?at=63765d2d655bc46025cfb9fe)