Robert Sachunsky

Results 272 issues of Robert Sachunsky

By implementing #635 to properly handle all cases of PAGE-XML file matching per pageId, we have lost sight of the severe performance penalty that this comes with. In effect, we...

The Docker images for core currently do not conform to our [Docker specification](https://ocr-d.de/en/spec/docker#labelling-images), i.e. they have no labels advertising the `vcs-ref`, `build-date` and so on.

During `ocrd zip bag`, we are packaging remote and local FLocat references as we see them. But file references may be shared between fileGrps – e.g. the same URL or...

Sometimes physical page identifiers look like `phys_fcd565cd-d170-4774-bb37-9aa25b68370b` – which is impossible to type/memorize. But even names like `PHYS_0001` are not as helpful as the `@ORDERLABEL` and `@ORDER` attributes. Since presentation...

Naturally, with operations like deskewing/rotation and cropping the resulting image size and relative coordinates depend on the relative order of the operations. We already respect that by following the exact...

By using https://github.com/click-contrib/click-man and https://github.com/click-contrib/click-completion it should be fairly simple to install all OCR-D scripts in such a way that the shell automatically becomes syntax-sensitive to their options/subcommands and that...

I think this is caused by a change in assets: https://github.com/OCR-D/assets/commit/b12e5ebc12450bd70e9ec7a9d7afeb48f6201773, which was supposed to fix https://github.com/OCR-D/assets/issues/87, but does not work. Here is a debug log of what actually happens...

As an idea to make sharing knowledge about (combinations of) external resources easier: It would be great if `processor.resolve_resource` would not only search the locally installed resources, but also try...

enhancement

PAGE-XML uses the XMLSchema `ID` type for all structure segments, which precludes duplicates (makes such usage invalid). However, when doing segmentation on a `OcrdPage` instance, if some segments already exist...

Note: In METS, the labels are a flat sequence of `gt:state` elements with `@prop` from the above mentioned schema file, one per page. ```XML ``` These are then referenced under...