core icon indicating copy to clipboard operation
core copied to clipboard

Collection of OCR-related python tools and wrappers from @OCR-D

Results 215 core issues
Sort by recently updated
recently updated
newest added

Right now, we're consciously only supporting the newest version of the PAGE Content schema and will upgrade to the newest version in Mid-July. However, we should support at least reading&upgrading...

The mechanism for backing up workspace data has next to no documentation and hasn't been in development for well over a year. Is anybody still using it? As we'll [add...

I don't know how realistic this is for OCR-D data providers: What if the original images are not oriented upright / positive, and the imaging system already knows about this...

enhancement

Now that we have support for pageId filtering and regexes in `find` / `remove` / `remove-group`, how about support for **partial cloning** with a filter (either comma-separated or regex) on...

enhancement

Like `DOM.getElementById` but for the PAGE XML API.

When I want to download a PAGE-XML from remote, it would be very helpful if core would also download all the files referenced in `/PcGts/Page/@imageFilename` and `*/AlternativeImage/@filename`. Is this feasible?

enhancement

We have the utility function `coordinates_for_segment` for conversion of polygons from some derived image and its coordinate transform to absolute coordinates of the original image. This is all the function...

enhancement
question

It would be interesting to see how well OCR-D can be used on computers with ARM architectures, like in Raspberry Pi or Odroid. Travis has [support for building for different...

enhancement

Currently cloning of a workspace with `ocrd workspace clone --download` aborts if some files cannot be downloaded. It would help if instead of aborting the download all other files would...

enhancement

@bertsky in #410 > Maybe we should have some pseudo-processor test in ocrd/bashlib/test (beside the arg-parsing test). We should.