hocr
hocr copied to clipboard
rewrite tidy_tesseract
without dplyr dependency..., make internal and call from hocr_parse