core issues

Allow import and upgrade of previous PAGE versions

Right now, we're consciously only supporting the newest version of the PAGE Content schema and will upgrade to the newest version in Mid-July. However, we should support at least reading&upgrading...

kba

[RFC] Drop workspace backup mechanism?

4

The mechanism for backing up workspace data has next to no documentation and hasn't been in development for well over a year. Is anybody still using it? As we'll [add...

kba

OcrdExif / page_from_image: show/use orientation

1

I don't know how realistic this is for OCR-D data providers: What if the original images are not oriented upright / positive, and the imaging system already knows about this...

bertsky

enhancement

workspace cloning: filter by fileGrps and/or pages

Now that we have support for pageId filtering and regexes in `find` / `remove` / `remove-group`, how about support for **partial cloning** with a filter (either comma-separated or regex) on...

bertsky

enhancement

generateDS API: find any element by ID

Like `DOM.getElementById` but for the PAGE XML API.

kba

workspace download: also traverse dependent file groups?

3

When I want to download a PAGE-XML from remote, it would be very helpful if core would also download all the files referenced in `/PcGts/Page/@imageFilename` and `*/AlternativeImage/@filename`. Is this feasible?

bertsky

enhancement

ocrd_utils.coordinates_for_segment: clip to parent?

5

We have the utility function `coordinates_for_segment` for conversion of polygons from some derived image and its coordinate transform to absolute coordinates of the original image. This is all the function...

bertsky

enhancement

question

ARM support

It would be interesting to see how well OCR-D can be used on computers with ARM architectures, like in Raspberry Pi or Odroid. Travis has [support for building for different...

kba

enhancement

RFC: Make workspace cloning more robust

6

Currently cloning of a workspace with `ocrd workspace clone --download` aborts if some files cannot be downloaded. It would help if instead of aborting the download all other files would...

stweil

enhancement

smoke test bashlib-based processor

@bertsky in #410 > Maybe we should have some pseudo-processor test in ocrd/bashlib/test (beside the arg-parsing test). We should.

kba

core
core copied to clipboard

Metadata

Allow import and upgrade of previous PAGE versions

[RFC] Drop workspace backup mechanism?

OcrdExif / page_from_image: show/use orientation

workspace cloning: filter by fileGrps and/or pages

generateDS API: find any element by ID

workspace download: also traverse dependent file groups?

ocrd_utils.coordinates_for_segment: clip to parent?

ARM support

RFC: Make workspace cloning more robust

smoke test bashlib-based processor

← Metadata

Owner

Metadata

core core copied to clipboard

Metadata

← Metadata

Owner

Metadata

core
core copied to clipboard