Robert Sachunsky

Results 721 comments of Robert Sachunsky
trafficstars

In 6263bb1 I have started implementing the [processing server](https://hackmd.io/23-JzLp_Q96cb6T0ttoFIA) pattern. It is completely indepent of the workflow server for now, so the latter still needs to be adapted to make...

@tdoan2010 your understanding of the workflow server is correct. This PR _also_ implements a **processing server**, but you did not address that part. To your interpretation: > ### Disadvantage >...

> * https://github.com/vahidrezanezhad/page-xml-to-image cf. https://github.com/vahidrezanezhad/page-xml-to-image/issues/1 > It might be useful to have a processor in OCR-D that renders PAGE-XML and its elements to an image file to debug. There are...

Ok, then I suggest simply moving all `ocrd-segment-extract-*` stuff from ocrd_segment into `ocrd.processor.builtin` and refactoring the various shared functions into `ocrd_models.ocrd_page`, maybe grouped into coordinate repair (unravelling validation reports), iteration...

> Now that the foundations are here, we can discuss what kind of information should be captured and how messages should be formatted. We discussed this briefly in yesterday's tech...

> the first one is already in place (`ocrd resmgr discover`). That's not quite what I meant though. I would expect the new `OcrdResourceManager.discover` to be called by `OcrdResourceManager.__init__` (right...

> Doing this every time the `OcrdResourceManager` is instantiated is a huge performance penalty. E.g. I currently have about 70 processors installed. A lot of them don't separate `__init__` and...

I took the liberty of checking tasks that have been solved by now above. > * code to mix the `Processor.ocrd_tool['resources']` with the preinstalled centralized database in `OcrdResourceManager`'s constructor and...

**Update:** > Currently, the workspace validator will complain about any kind of file (including derived images) that is not contained in the structMap as physical page. That's _still_ the case...

Example of this bug: ``` File 'FULLDOWNLOAD' does not manifest any physical page. ``` Can we at least get a skip flag for this, @kba?