tesseract icon indicating copy to clipboard operation
tesseract copied to clipboard

Bindings to Tesseract OCR engine for R

Results 20 tesseract issues
Sort by recently updated
recently updated
newest added

I have a PDF file which has been scanned on. From my understanding this needs to go through OCR before it can become a CSV file. I am new to...

Hello there, Thanks for this amazing binding! I am running into some performance issues and I wonder if you have some hints or ideas. Basically, the R wrapper works fine...

It would be great if this package supported adding back the retrieved text from a raster to PDF format. For example, using `tesseract` directly from the command line makes this...

Users installing on Linux machines may see: ``` Error opening data file /usr/share/tesseract-ocr/tessdata/eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory....

Hi all, I'm working in recognize some pdf image that haven't an excellent quality and I want to perform a better OCR using the third dictionary, called user-words, but I...

I have to pull data from a pdf uploaded at a URL. The pdf is in an image/.png format hence while using the tesseract package few of the lines were...

``` $ install.packages("tesseract") “package ‘tesseract’ is not available (for R version 3.6.0)” ```

Hi all, I tried this package to extract text from a simple picture, however, the results are not as good as expected, here is my pic with 300dpi: ![path_ziji_300](https://user-images.githubusercontent.com/19953005/62859279-40377680-bd2f-11e9-8f82-310ddfb2f746.jpg) ```{r}...

Hi, I'm experiencing an issue using page segmentation mode on 1 (auto+osd), where the following call results in an error message: `engine Tesseract couldn't load any languages! > Warning: Auto...

This is related to #8 and #39 (or more accurately, the underlying ideas within them). With the upstream issue that the whitelist and blacklist are not implemented in tesseract 4...