tesseract icon indicating copy to clipboard operation
tesseract copied to clipboard

Bindings to Tesseract OCR engine for R

Results 20 tesseract issues
Sort by recently updated
recently updated
newest added

Hi, When i loop over my PDFs and use OCR_Data, after a while (about 2 hours) it produces the following error: TIFFReadEncodedStrip Error --------------------------- Read error at scanline 0; got...

Thanks for a great package @jeroen! I have two suggestions regarding the vignette. # Vignette access To read the vignette one either has to go read the CRAN version or...

Hi, I was wondering if anyone can help me figure out how to use the user_patterns_suffix option when setting up the engine? I'd like to parse PDFs for a specific...

See https://github.com/tesseract-ocr/tesseract/wiki/APIExample

https://tesseract-ocr.github.io/tessdoc/Data-Files seems to indicate that tessdata_best may be better than tessdata (but doesn't support legacy): Currently (with version 5), it seems as though it only uses tessdata_fast, correct? https://github.com/ropensci/tesseract/blob/913157029d05fccbe8ab76f7b9de45ca4cca66a0/R/tessdata.R#L47C14-L47C27

this PR here does not introduce breakage, but it works under the assumption that we have tesseract >= 5 the real change here is src/ + DESCRIPTION the rest is...

this came up today https://www.theserverside.com/feature/Why-GitHub-renamed-its-master-branch-to-main "main" is a prefer term for a long list of considerations

render all images ofline + update examples + compare with aws ocr this has no changes to the functions, except the parts to use the "best" models