ocrmypdf-auto icon indicating copy to clipboard operation
ocrmypdf-auto copied to clipboard

Feature Request: Option to move non PDF to Output Folder

Open MrReSc opened this issue 6 years ago • 2 comments

Hello, I use your container for my paperless office, and it works great. The scanner stores the PDF in a folder (Input) on my server which the users don't have access to. Then ocrmypdf-auto does its work and saves the finished PDF on a shared drive. From time to time images (JPG) are also scanned. These scans are not processed by ocrmypdf-auto because the file extension is not *.pdf, this is correct. Would it be possible to include an option to simply move files that do not end with *.pdf to the Output folder?

MrReSc avatar Jan 05 '19 05:01 MrReSc

Hi @MrReSc - I like the idea... I will take a look at the code and see how much effort it would be to start taking actions on non-PDF files.

I can imagine a few different actions being interesting:

  1. As you suggest, just move the file to the output folder.
  2. Convert the file to a PDF and attempt the same OCR over it.

cmccambridge avatar Jan 24 '19 15:01 cmccambridge

Thank you for looking at my proposal. If I could test something, I would of course help you.

MrReSc avatar Feb 18 '19 14:02 MrReSc