ochre icon indicating copy to clipboard operation
ochre copied to clipboard

Additional OCR Post correction datasets

Open jvdzwaan opened this issue 6 years ago • 2 comments

Can be added to the list of datasets.

  • MiBio
    • https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6197712/
    • https://github.com/jie-mei/MiBio-OCR-dataset

jvdzwaan avatar Nov 19 '18 16:11 jvdzwaan

  • RETAS
    • Text alignment software and evaluation dataset
    • email to obtain
    • http://ciir.cs.umass.edu/downloads/ocr-evaluation/

jvdzwaan avatar Nov 19 '18 16:11 jvdzwaan

OCR text, but no gold standard: https://github.com/marriott-library/collections-as-data

jvdzwaan avatar Feb 03 '19 08:02 jvdzwaan