tesserocr icon indicating copy to clipboard operation
tesserocr copied to clipboard

Wheels on Pypi

Open ricardomga opened this issue 6 years ago • 21 comments

Please consider uploading the wheels to PyPI! It would be realy helpful.

ricardomga avatar Jun 29 '18 09:06 ricardomga

@simonflueckiger could you please help and provide wheels for linux too? Thanks.

ricardomga avatar Jul 06 '18 16:07 ricardomga

@ricardomga just letting you know that I'm currently looking into this

simonflueckiger avatar Jul 09 '18 08:07 simonflueckiger

@simonflueckiger thank you a lot!

ricardomga avatar Jul 09 '18 09:07 ricardomga

@simonflueckiger I really need this, if there is something that I can do to speed up the process let me know.

ricardomga avatar Jul 09 '18 17:07 ricardomga

@ricardomga let me know what setup you have (OS / 32bit or 64bit / python version) and I can give you a wheel for testing :)

simonflueckiger avatar Jul 09 '18 17:07 simonflueckiger

@simonflueckiger 64bit, gnu/linux (I think the distro is not important), python 3.6

ricardomga avatar Jul 09 '18 18:07 ricardomga

@ricardomga thanks, forgot to ask: are you looking for tesseract 4 or 3.05.01/02?

simonflueckiger avatar Jul 09 '18 18:07 simonflueckiger

@simonflueckiger 4 please

ricardomga avatar Jul 09 '18 18:07 ricardomga

@ricardomga can you do some testing with this:

tesserocr-2.3.0-cp36-cp36m-manylinux1_x86_64.zip

it bundles tesseract 4.0.0-beta.2, leptonica-1.76.0, libgif 5.1.4, libjpeg 9b, libpng 1.6.34, libtiff 4.0.9, zlib 1.2.11

simonflueckiger avatar Jul 09 '18 20:07 simonflueckiger

I uploaded all the binaries for linux 32bit/64bit and Python 2.7/3.4/3.5/3.6/3.7 in case anyone else would like to test them:

https://github.com/simonflueckiger/tesserocr-windows_build/releases/tag/tesserocr-v2.3.0-tesseract-4.0.0-beta.2

simonflueckiger avatar Jul 09 '18 22:07 simonflueckiger

@simonflueckiger awsome, thank you a lot!

@sirfz It would be realy nice if you upload this to Pypi, the installation would as simple as pip install tesserocr. This would really easy the deploy process in the cloud providers and to through docker containers, please consider this. @simonflueckiger what do you think about uploading this to Pypi?

ricardomga avatar Jul 10 '18 10:07 ricardomga

@ricardomga did the wheels work for you?

@sirfz @ricardomga I think uploading them to PyPI is a splendid idea! Since all the binary dependencies were built on the manylinux docker images and are packaged with the wheels, they should be working out of the box on pretty much any distro released after 2007.

simonflueckiger avatar Jul 10 '18 13:07 simonflueckiger

@simonflueckiger I am getting a segmentation fault in docker image python:3.6.5 Any sugestion?

ricardomga avatar Jul 10 '18 17:07 ricardomga

@ricardomga What code are you trying to run, do you know which specific line results in the segfault? What's the exact error you are getting? Which docker image are you using? Also, what do you get for

import tesserocr

print(tesserocr.tesseract_version())
print(tesserocr.get_languages())

simonflueckiger avatar Jul 10 '18 18:07 simonflueckiger

@simonflueckiger docker image: python:3.6.5

import tesserocr
pritnt(tesserocr.tesseract_version())
'tesseract 4.0.0-beta.1\n leptonica-1.76.0\n  libgif 5.1.4 : libjpeg 9b : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 1.0.0'
print(tesserocr.get_languages())

this statment stays hanged

image = Image.open('sample.jpg')
print(tesserocr.image_to_text(image))

This gives the error Segmentation fault (core dumped)

ricardomga avatar Jul 10 '18 23:07 ricardomga

@simonflueckiger were you able to replicate the problem?

ricardomga avatar Jul 11 '18 20:07 ricardomga

@ricardomga yes, I am getting the exact same behavior just as you described it. Looking into it ...

simonflueckiger avatar Jul 11 '18 21:07 simonflueckiger

@simonflueckiger thank you

ricardomga avatar Jul 11 '18 22:07 ricardomga

@simonflueckiger sorry to bother, any news?

ricardomga avatar Jul 12 '18 22:07 ricardomga

@simonflueckiger can you please try with tesseract 4.0.0 beta4?

ricardomga avatar Aug 14 '18 09:08 ricardomga

@simonflueckiger can you please try with tesseract 4.0.0 RC1?

ricardomga avatar Oct 01 '18 11:10 ricardomga

I discontinued the Linux builds a while ago (only Windows is supported now). @sirfz can we close this issue? If there still is demand, I could create a PyPI repo and upload the Windows wheels there.

simonflueckiger avatar Jan 19 '23 10:01 simonflueckiger

I'd be happy to include build wheels if someone wants to contribute a CI pipeline that produces relevant wheels. (added "help wanted" tag)

sirfz avatar Mar 13 '23 15:03 sirfz

fyi i tried my hand at this in #318

betaboon avatar Apr 15 '23 18:04 betaboon

@sirfz as the wheels provided by #318 are not yet available on pypi, i would argue this issue should be re-opened.

betaboon avatar Apr 24 '23 12:04 betaboon

Hey @betaboon, I assume the wheels are released on pypi by now. Doesn't that make the README docs outdated since we no longer have to install Tesseract on our system?

winstxnhdw avatar Oct 23 '23 13:10 winstxnhdw