page2tei
page2tei copied to clipboard
export of TableCell suppressed
My output from Transkribus looks like
<TableRegion id="Table_1572948924800_4" custom="readingOrder {index:0;}">
<Coords points="85,74 3981,55 4041,4922 93,4939"/>
<TableCell row="0" col="0" rowSpan="1" colSpan="1" leftBorderVisible="true" rightBorderVisible="true" topBorderVisible="true" bottomBorderVisible="true" orientation="0.0" id="TableCell_1572948967045_27">
<Coords points="86,91 83,494 484,483 485,81"/>
<TextLine id="TableCell_1572948967045_27l1" custom="readingOrder {index:0;}">
<Coords points="93,212 161,219 186,221 212,222 237,223 263,222 288,220 314,218 339,216 365,213 379,229 443,257 443,172 390,126 365,128 339,131 314,133 288,135 263,137 237,138 212,137 186,136 164,95 96,88"/>
which, when transformed with page2tei-0.xsl, misses the information about the TableCell, i.e. no corresponding <tei:zone> element is created.
To me, this looks like a bug and I'm happy to send in a PR, yet several lines, e.g. https://github.com/dariok/page2tei/blob/b7f5923eb944874312fd5159f65142b35f58cdc8/page2tei-0.xsl#L199 and https://github.com/dariok/page2tei/blob/b7f5923eb944874312fd5159f65142b35f58cdc8/page2tei-0.xsl#L246 seem to indicate that this was done on purpose?