pyocr icon indicating copy to clipboard operation
pyocr copied to clipboard

Test environment to make tests reproducable

Open jflesch opened this issue 8 years ago • 1 comments

Idea: provide a pre-compiled environment (container / chroot ?) containing Tesseract and all its dependencies ---> could be used to make the test results reproducibles perfectly.

jflesch avatar Nov 14 '17 14:11 jflesch

I had a quick look at the unit-tests. Looks like a lot of them are dependent on the actual OCR process itself. Seeing as the tesseract-ocr project changes the trained data occasionally, changes their actual algorithms, and one is not sure which version of the trained data each person has installed, you're pretty-much guaranteed to get slight mismatches in the OCR results every time you run the tests on a new machine.

ZoranPavlovic avatar Jan 02 '18 20:01 ZoranPavlovic