spec issues

3

Currently, we only specify how to describe the hierarchy of pages (represented by a set of files under `mets:structMap/mets:div/mets:div`) and their order. But nothing so far on logical structure **across...

bertsky

new mets:fileGrp

It would be very useful to introduce a new mets:fileGrp especially for ground truth datasets. This new group includes both Region and Line level segmentations. Suggested name: ```xml ```

tboenig

Specify the "mets:file[@ID] should match pc:Page[@pcGtsId]" rule

While checking https://github.com/OCR-D/core/pull/1066, I noticed that we have the rule in the validator but AFAICT not in the specs that the `@pcGtsId` of a PAGE document should be the same...

kba

Specs for Continous Integration and Code Reviews

2

We briefly talked about those in the Tech Call today and decided to make these part of the spec, hence this PR. I took the liberty of updating the list...

kba

ocrd_eval: add alternative order metrics

1

instead of https://github.com/OCR-D/ocrd-website/pull/354

bertsky

QA Specs: How to deal with consecutive white spaces

3

The specification currently makes no suggestion on how to deal with more than one consecutive white space character.

mweidling

Skipping OCR processing based on logical `mets:structMap`

8

From my and @bertsky's discussion at https://github.com/qurator-spk/eynollah/issues/67: >> Yes, it should be possible to skip pages marked as certain types in the logical structmap – not just in any one...

mikegerber

Web API: change /processor/{executable}/{job_id} to just /processor/{job_id}

According to this [discussion on the Processing Server implementation](https://github.com/OCR-D/core/pull/974#discussion_r1138901846), we should simplify here. (But it must be clear at all times what is a workflow job ID and what is...

bertsky

Dockerfile: improve and update to GHCR

bertsky

spec
spec copied to clipboard

Metadata

Sort import statements with isort

allow global fptrs in structMap

new mets:fileGrp

Specify the "mets:file[@ID] should match pc:Page[@pcGtsId]" rule

Specs for Continous Integration and Code Reviews

ocrd_eval: add alternative order metrics

QA Specs: How to deal with consecutive white spaces

Skipping OCR processing based on logical `mets:structMap`

Web API: change /processor/{executable}/{job_id} to just /processor/{job_id}

Dockerfile: improve and update to GHCR

← Metadata

Owner

Metadata

spec spec copied to clipboard

Metadata

← Metadata

Owner

Metadata

spec
spec copied to clipboard