Philipp Zumstein

Results 87 comments of Philipp Zumstein

I guess that it still validates without the SP tags. Moreover, most of the information (HPOS, WIDTH) can be calculated from the line above and below, but if the width...

> but the seems to imply it. Not sure. I only see here, that, if `` occurs, then it has to occur after a ``.

The addition of the `` should be handled upstream in the corresponding transformation. Currently, we use hocr2alto and page2alto. We can keep this issue here open as a reminder.

Also http://able.myspecies.info/abbyy-xml-tei-xml (looks a little special at first glance...)

We don't have any use case for this at the moment. Maybe, we can just leave the issue open here and collect more information and any possible implementations by reusing...

Thank you @kba, that looks interesting as well! Let me know when anyone wants to work on integrating any of these transformation in `ocr-fileformat`.

Nice! @jmechnich Can you create a PR? Then it is easier to discuss this further. But I am quite happy with such a XSLT transformation, even when there are no...

Do you have any test data as JSON from Computer Vision OCR you can share here?

I am not sure that we need git submodules with which a lot of people also struggle to use. Moreover, this would only bee a partial update mechanism because we...

Can you share a complete ALTO file as an example illustrating the problem?