core
core copied to clipboard
Collection of OCR-related python tools and wrappers from @OCR-D
clobber_mets = False does not raise an exception for existing mets.xml ('raise'), but fails silently ('skip') https://github.com/OCR-D/core/blob/4e11b8e70a2426f3d25466f462db13bf31215919/ocrd/ocrd/resolver.py#L114 see: https://github.com/OCR-D/core/blob/4e11b8e70a2426f3d25466f462db13bf31215919/ocrd/ocrd/resolver.py#L154 I don't know what would be the right thing to do:...
* New class `OcrdMetsFilter` in `ocrd_models` that represents restrictions on files (include/exclude by fileGrp, mimetype currently) * `ocrd workspace clone` supports * `--fileGrp-include` * `--fileGrp-exclude` * `--mimetype-include` * `--mimetype-exclude` Proposed...
* ValidationReport -> ocrd_models * Decouple parsing and validation of tasks into OcrdWf (workflow), OcrdWfStep and OcrdWfValidator
Load the parameters from the ENV, if they are defined to provide an easier handling.
Now that we have `prune-files` on the one hand and `bulk-add` on the other, how about a new command that lists those files in the workspace directory which are **not**...
As I suggested on the call on software development best practices (7 jul 2020), it might be interesting to release the OCR-D modules also as [conda](https://docs.conda.io/en/latest/) packages. Since the application...
I am trying to fix bagit checksums in OCR-D/assets#64 with `ocrd zip bag -Z -I`: - When I use `-d data`, then the bagger will move everything to `data/data`. -...
IMO we currently have an unfortunate situation regarding support for modern image formats: (In the following, I will refer to Ubuntu 18, because it is our default target for Travis/CircleCI...