setup instructions for Windows
Please check the relevant content in the Windows section of the readme.md.
Thanks for this! This doesn't seem to include some dependencies:
ocrmypdftesseract+ language packs- ghostscript
Did you find how to install these?
Thanks,I will continue to study these issues when free time.
Hi, I found the tesseract 5 binary on https://github.com/UB-Mannheim/tesseract/wiki. I am just now installing marker and am waiting for a nvidia graphics card to use CUDA. So have not tested it yet.
or winget install -e --id UB-Mannheim.TesseractOCR (found through the OCRmyPDF page)
Reference in new issue
HI, have you found the perfect solution for windows?
I have updated the README.md, and following the steps, both my friend and I have successfully run the project.
I encountered two issues following the steps detailed in your instructions:
- I encountered the error
ImportError: failed to find libmagic. Check your installationwhich I resolved by running (inpoetry shell)pip install python-magic-bin==0.4.14as described in this StackOverflow answer. - I encountered the error
AssertionError: Torch not compiled with CUDA enabledwhich I resolved by running (inpoetry shell)pip uninstall torchand thenpip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118(note that this will installtorchcomplied for CUDA 11.8) as described by PyTorch's official documentation. ModuleNotFoundError: No module named 'pkg_resources'which I resolved by runningpip install setuptools.
After running these fixes, I was able to get marker working in my Windows environment.
I have created a new PR that addresses the issues raised in my comment above.
Thanks for looking into this! The new version has no required system dependencies, so Windows install should be much simpler - https://github.com/VikParuchuri/marker/pull/116