core icon indicating copy to clipboard operation
core copied to clipboard

page validator: additional checks

Open bertsky opened this issue 4 years ago • 1 comments
trafficstars

Beyond actual (syntactic) schema violations ("validity") and conventional (semantic) problems ("inconsistency"), we might want to check for and repair additional issues:

  • if /PcGts/Page/ReadingOrder or any of its children is empty (in which case PageViewer fails to load) – as repaired by https://github.com/bertsky/workflow-configuration
  • if any /PcGts/Page/ReadingOrder//@regionRef does not point to an existing segment identifier (in which case PageViewer fails to load) – as repaired by https://github.com/bertsky/workflow-configuration
  • if any //TextEquiv contains neither PlainText nor Unicode (in which case PageViewer fails to load) – as repaired by https://github.com/bertsky/workflow-configuration
  • ...?

bertsky avatar Nov 10 '21 14:11 bertsky

Also:

  • if any /PcGts/Page/ReadingOrder/(OrderedGroup|OrderedGroupIndexed)/@index is not in order (or clashing)
  • if any //TextEquiv/@index is not in order (or clashing)

bertsky avatar Sep 16 '22 16:09 bertsky