core icon indicating copy to clipboard operation
core copied to clipboard

Collection of OCR-related python tools and wrappers from @OCR-D

Results 215 core issues
Sort by recently updated
recently updated
newest added
trafficstars

First we need to have a test for `--overwrite` that fails…

Hello, please add DPI-Parameters to enable to enforce custom resolution when using tesseract ## Tesseract CLI `--dpi 470` ## ocrd-tesserocr `dpi: 470`

Currently when there is a `@points=""` the validator crashes: ``` File "ocrd_validators/page_validator.py", line 274, in validate_consistency node_poly = make_poly(polygon_from_points(parent_points)) File "ocrd_utils/image.py", line 148, in polygon_from_points polygon.append([float(x_y[0]), float(x_y[1])]) ValueError: could not...

We currently add our `get_AllRegions()` to the `Page` level only. But the PAGE-XML / generateDS API makes it complicated to enumerate all types or regions – not only on the...

enhancement

If neither the input fileGrp nor the page ID is directly contained in the file ID of the input file, then `make_file_id` determines the index of that file in the...

bug

Implementation of the [workflow server](https://hackmd.io/23-JzLp_Q96cb6T0ttoFIA). ocrd --help ``` Commands: bashlib Work with bash library log Logging ocrd-tool Work with ocrd-tool.json JSON_FILE validate All the validation in one CLI process Run...

By implementing #635 to properly handle all cases of PAGE-XML file matching per pageId, we have lost sight of the severe performance penalty that this comes with. In effect, we...

Example: https://circleci.com/api/v1.1/project/github/OCR-D/ocrd_calamari/177/output/106/0?file=true&allocation-id=62165a9241d4334ebb050ee2-0-build%2F1CB8E496 Excerpt: ~~~ ocrd resmgr download ocrd-calamari-recognize qurator-gt4histocr-1.0 16:06:28.067 INFO ocrd.cli.resmgr - Downloading resource {'url': 'https://qurator-data.de/calamari-models/GT4HistOCR/2019-12-11T11_10+0100/model.tar.xz', 'type': 'tarball', 'name': 'qurator-gt4histocr-1.0', 'description': 'Calamari model trained with GT4HistOCR', 'size': 90275264, 'path_in_archive':...