Node icon indicating copy to clipboard operation
Node copied to clipboard

GH-598: First QA fix

Open bertllll opened this issue 2 months ago • 0 comments

bertllll avatar Oct 03 '25 13:10 bertllll

Oh, and since the question has been raised somewhere: I stumbled again upon this:

https://github.com/OCR-D/core/blob/255ddbd521b4d95346e71575775565c627432455/Makefile#L234-L238

So, apparently, the namespace prefix is not merely the PAGE schema's choice, but actually had to enforce it.

I am not sure, if everything will still work out of the box, if we drop this, though. (IMHO it should stay, anyway.)

bertsky avatar Oct 29 '25 17:10 bertsky

Oh, and since the question has been raised somewhere: I stumbled again upon this:

https://github.com/OCR-D/core/blob/255ddbd521b4d95346e71575775565c627432455/Makefile#L234-L238

So, apparently, the namespace prefix is not merely the PAGE schema's choice, but actually had to enforce it.

I am not sure, if everything will still work out of the box, if we drop this, though. (IMHO it should stay, anyway.)

I think we did introduce this not because we explicitly wanted a pc: prefix, but because generateDS did generate XML which used the pc: prefix for the elements but did not define it, i.e. that was a bug in generateDS.

It's also sensible IMHO because in the docs, we always use pc: for PAGE and mets: for METS. I also just personally prefer it that way, but that's not a strong argument ;)

In any case, thanks for the update, I'll test it and let's discuss the ramifications. I do like the features, I fear the repercussions of keeping PAGE-XML in different versions interoperable...

kba avatar Oct 30 '25 10:10 kba

Just for reference: the version of page.xsd I used here is exactly the one from current PRImA master – just without carriage return EOL (because that would have made for a very big diff).

As you have pointed out on other channels, it seems there was already a decision made by PRImA to release a 2024 namespace – just not public on their Github. In that changelog they unfortunately attribute

  • MapRegion can now be used recursively (nested in other regions)

to the 2024 version, not the 2019 version, which indicates they are not willing to let this be covered as a mere bugfixing postrelease (as in the other patch release for 2019). IMHO this is not a good decision (if it is deliberate at all)

bertsky avatar Oct 30 '25 11:10 bertsky