dinglehopper
dinglehopper copied to clipboard
An OCR evaluation tool
When printed, the HTML report has the following issues for long texts: * Page break between "Character differences" header and the differences * Long texts are cut off after one...
The stylesheet for the report linked as `` should be distributed with the sources in order to support the rendering even when the tool is used offline.
Motivated by some experiments with the corpus of [Deutsches Textarchiv](https://www.deutschestextarchiv.de/download), it would be convenient if we could read TEI.
This PR adds a Dockerfile and Makefile to create a dockerimage for this processor. Ideally all OCR-D processors offer the same way to create an image for them, which is...