Mike Gerber

Results 79 comments of Mike Gerber

> > while [Mewel/abbyy-to-alto](https://github.com/Mewel/abbyy-to-alto) worked right away. > Sort of - it does not produce `Processing` tags (or the ALTO v2 equivalent), so it is lacking too. I've added that...

Is langdata obsolete as langdata_lstm exists?

As far as I can understand this, sbb_page_extractor is one processing step of eynollah, maybe it could just be an eynollah option to only do this aspect of the overall...

While I think "Position (order number) of this item within the current hierarchy level. " (from PAGE-XML's [schema](https://www.primaresearch.org/schema/PAGE/gts/pagecontent/2019-07-15/pagecontent.xsd)) could be clearer, I, too, think the implementation in both PAGE Viewer...

(Is there any good reason to order XML elements by a special index attribute instead of just using the XML order? For stream rewriting maybe?)

@vahidrezanezhad Any news on this bug?

While eynollah should handle this gracefully, we should also consider how to handle irrelevant images that are already marked as such in the METS `structMap`. In this case possibly `spine`...

> Yes, it should be possible to skip pages marked as certain types in the logical structmap – not just in any one processor, but as a general mechanism for...

I'd like to note that when creating a Python virtualenv, you can specify the Python interpreter to use, i.e.: ~~~ virtualenv -p /usr/bin/python3.7 /tmp/venv-example ~~~ or, using mkvirtualenv (if you...

> > could something like `pip > 19` be added to `requirements.txt`? > > It could but then you still have to run it twice because the first time round...