tessdata_best
tessdata_best copied to clipboard
Ara language is showing "empty page!!" on one laptop and gives the answer on another
Downloaded ara_best language to test on tesseract and it showed "empty page !!"
tesseract version: tesseract 5.0.1 leptonica-1.79.0 libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 2.0.3) : libpng 1.6.37 : libtiff 4.1.0 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.1 Found AVX2 Found AVX Found FMA Found SSE4.1 Found OpenMP 201511 Found libarchive 3.4.0 zlib/1.2.11 liblzma/5.2.4 bz2lib/1.0.8 liblz4/1.9.2 libzstd/1.4.4 Found libcurl/7.68.0 OpenSSL/1.1.1f zlib/1.2.11 brotli/1.0.7 libidn2/2.2.0 libpsl/0.21.0 (+libidn2/2.2.0) libssh/0.9.3/openssl/zlib nghttp2/1.40.0 librtmp/2.3
OS: ubuntu 20.04 ( both laptops have the same os and same tesseract version ) when i try to list available langauges, ara language appears in them without any problem
is there any explanatory reason for this ?
Thanks in advance!
You did not show us the image which gives those different results. Can you add it to this issue report?
With identical hardware and software the OCR results are normally identical, too. Different hardware and software can result in (typicall very small) differences.
@stweil Thank you for the quick response ! I realized a slight difference that might cause the problem also, i tried to run one of them from cmd and the other from pytesseract from cmd i got Empty Page !! and from pytesseract i got the correct answer
i tried also to specify --psm mode in cmd until it worked to show some output but it wasnt accurate, where as in pytesseract the result was way better.
Thanks in advance
Please share test.jpg and the expected correct OCR result so that we can test.