Robert Sachunsky

Results 272 issues of Robert Sachunsky

In #278 we already had the case were the METS got broken due to ctrl+C in a single process. We discussed about parallelization but agreed to defer that topic. Now...

enhancement

Despite the changes in the spec long ago, not requiring a processor to have an `output_file_grp`, core still enforces its defaults, both in the decorators… https://github.com/OCR-D/core/blob/6ca42f1614fbcaa07801d28d890d104223eb5aa3/ocrd/ocrd/decorators/ocrd_cli_options.py#L25 …and in the Processor...

bug

Using any of core's API (even just `ocrd_utils`) now requires calling `initLogging` at some point prior to usage, or the respective subpackage's `getLogger` will complain with `CRITICAL`. So not only...

Currently with workspaces we can either keep images on the remote side by using http URLs in `mets:file/mets:FLocat/@xlink:href` (which means they have to be downloaded again and again during processing),...

enhancement
question

The most recent generateDS PAGE-XML model now contains validation of type restrictions, which is laudable. But these messages are aggregated in a way which makes diagnosing just where the error...

We often have lots of useful documentation for processors which the ocrd-tool.json does not and cannot cover: - README file - [DITA](https://ocr-d.de/en/dita.html) files - other documentation (publications, research notes, markdown...

enhancement
documentation

Currently, any information on image resolution provided in the original image (and made available via `OcrdExif` in `Workspace.image_from_page`) is ignored when saving derived images in the workspace (via `Workspace.save_image_file`). Due...

bug
enhancement

Now that we have `prune-files` on the one hand and `bulk-add` on the other, how about a new command that lists those files in the workspace directory which are **not**...

enhancement

I am trying to fix bagit checksums in OCR-D/assets#64 with `ocrd zip bag -Z -I`: - When I use `-d data`, then the bagger will move everything to `data/data`. -...

bug

IMO we currently have an unfortunate situation regarding support for modern image formats: (In the following, I will refer to Ubuntu 18, because it is our default target for Travis/CircleCI...

enhancement