Parsr
Parsr copied to clipboard
Transforms PDF, Documents and Images into Enriched Structured Data
Is it possible to parse epub using pandoc?
**Summary** GUI is blank except for the hearder. Same bug as #476 but the proposed solution didn't work. Using past versions leads to the same bug. **Steps To Reproduce** ```...
**Is your feature request related to a problem? Please describe.** - This feature will be really useful for the following scenario in a production application: - Connect to a document...
**Summary** I would like to read both the document content in the SimpleJSON format and the document metadata collection . **The problem** I'm frustrated to see that the SimpleJSON export...
Looks like this is just a typo. Call to __text_objects_none_page() is missing the 's': https://github.com/axa-group/Parsr/blob/34c3622abeae9075a8468a9cfc8530aea4eeed3b/clients/python-client/parsr_client/parsr_output_interpreter.py#L60
**Version** parsr-client==3.2.3 **Issue** the returned dictionary when using 'wait_till_finished=True' was: ```python return { 'file': file, 'config': config, 'status_code': r.status_code, 'server_response': r.text } ``` It should be: ```python return { 'file':...
**Summary** The bare metal installation instructions are out of date and incomplete. **Additional context** The bare metal installation instructions on the page `docs/installation.md` needs to be updated. The issues I...
**Summary** On MacOS 11.6.4, using Parsr 1.2.2, the `camelot` package cannot find the Ghostscript library installed by macports. I have good reason to believe that homebrew has the same problem....
**Is your feature request related to a problem? Please describe.** It appears that all `gs` usage was removed in 2020. However, ghostscript is still installed, e.g., in `Dockerfile`. This results...
**Summary** Processing a document throws this error `No module named 'sklearn.feature_selection.rfe'.` loading levels_model.pkl in HeadingLevelPrediction.py is causing the error