Vik Paruchuri

Results 136 comments of Vik Paruchuri

I honestly haven't looked into PDF format versions, so I don't know if that is it. That's an interesting find, though. It may also be related to whether the data...

I'm currently working on a better way to detect columns that might be useful for this

This should be fixed in the new version (coming in the next couple of weeks).

Try the dev branch if you're having issues - there is better ordering implemented there, but still have to test more before merging.

Thank you for this, @gardner ! Is there any easy way to get this to work with or without a cuda device? I imagine it would need a different base...

Will merge a demo gradio app as well shortly

Try the new code/model - `pip install -U surya`

This seems to work and You may need to experiment with the threshold settings to detect more text (see README)

Thanks for this! This doesn't seem to include some dependencies: - `ocrmypdf` - `tesseract` + language packs - ghostscript Did you find how to install these?

Thanks for looking into this! The new version has no required system dependencies, so Windows install should be much simpler - https://github.com/VikParuchuri/marker/pull/116