Parsr
Parsr copied to clipboard
Transforms PDF, Documents and Images into Enriched Structured Data
**Summary** When I tried to upload a simple Image PDF to Parsr. I am getting blank output with default parameters **Steps To Reproduce** Steps to reproduce the behavior: Input: [1stpage.pdf](https://github.com/axa-group/Parsr/files/8007597/1stpage.pdf)...
**Summary** Processing a PDF I see an error in the logs **Steps To Reproduce** just upload an PDF which is an image and set the OCR on **Expected behavior** A...
**Is your feature request related to a problem? Please describe.** It could be nice to be able to use the cloud version of Abbyy **Describe the solution you'd like** Just...
Is possible to run file processing in paralell or at least faster somehow? Parsr needs 20 minutes for converting file of size around 30mb.
Hello, thank you for developing this library, I found a strange PDF, which blocks parsr at 100%CPU without any progress in hours... PDF has only 1 page. trying to debug...
To reproduce: Run the V1.1.0 Docker image and try to extract tables with TableDetection2 enabled. ```bash parsr_1 | File "/usr/local/lib/python3.7/dist-packages/tabula/io.py", line 85, in _run parsr_1 | check=True, parsr_1 | File...
**Summary** Docker image build fails on copying `policy.xml` to `imagemagick` directory. **Steps To Reproduce** 1. Get clean repo 2. run `npm build` 3. run `docker-compose -f docker-compose-build.yml build` 4. Get...
the text `5 April 2009.` at the end of a sentence where `2009.` is on the next line breaks list-detection marking the line as item number 10 in the list...
**Is your feature request related to a problem? Please describe.** You can retrieve the markdown, CSV and Text results via the API. I personally find that very convenient for "simple"...
**Is your feature request related to a problem? Please describe.** There is no problem. **Describe the solution you'd like** Provide a jsond or any equivalent format to understand the datamodel...