Stirling-PDF icon indicating copy to clipboard operation
Stirling-PDF copied to clipboard

Suggestion to prevent volume/path breaking changes

Open Royal2000H opened this issue 2 years ago • 1 comments

From the 0.16.0 release notes:

OCR update to tesseract 5 from 4.00 (Requires volume path change) and OCRmyPDF update

I assume it refers to this: - /location/of/trainingData:/usr/share/tesseract-ocr/4.00/tessdata Changing into:

- /location/of/trainingData:/usr/share/tesseract-ocr/5/tessdata

My suggestion would be not requiring the version number in the docker config in order to prevent continuous breaking changes. Possibly you could set up a folder like /trainingData that hard links or something to where you need it (/usr/share/tesseract-ocr/5/tessdata in this case). This would require yet another breaking change but would prevent future ones. Also if you do it soon, then likely people only need to fix their configurations once.

So then people just mount - /location/of/trainingData:/trainingData

I don't know how to do this or I'd submit a PR.

Royal2000H avatar Dec 11 '23 01:12 Royal2000H

For now i added a script which copes /usr/share/tesseract-ocr/4.00/tessdata to /usr/share/tesseract-ocr/5/tessdata

So there shouldnt actually be a breaking change, but i wanted to say it to get people to copy things over anwyay

but i see your point, there should be a static location.. i will consider this!

Frooodle avatar Dec 11 '23 12:12 Frooodle

Tesseract did this in their latest version long time back now, moving all to just /usr/share/tesseract

Frooodle avatar Jul 31 '24 19:07 Frooodle