pdfminer.six
                                
                                
                                
                                    pdfminer.six copied to clipboard
                            
                            
                            
                        Community maintained fork of pdfminer - we fathom PDF
Split of from #497 by @typhoon71 Now that the AcroForm howto is merged, I suppose I can consider some other cases. I waited after the PR got merged because it...
Unfortunately I cannot include the PDF as it is a bank statement, but hopefully the details below are enough. The error is as follows: ```shell DEBUG:pdfminer.pdfdocument:trailer={'Size': 70, 'Root': , 'Info':...
Now that the AcroForm howto is merged, I suppose I can consider some other cases. I waited after the PR got merged because it was becaming difficult for me to...
It took me hours with a debugger to figure out why the result of conversion to text was just running all the words together into one very long string! Then...
The learning curve for pdf's can be quite steep. It would help new contributors (like me) if there are some recommended readings or references. I suggest we put those in...
It would be nice to measure the execution time of all tests that parse a specific pdf file. With this in place it is easy to check if PR's increase/decrease...
The first release in the CHANGELOG.md is 20181108. We could add information about the releases before that.
**Is your feature request related to a problem? Please describe.** No, is not a problem **Describe the solution you'd like** My PDF generates around 600KB XML and most of it...
Migrating from the original pdfminer should be easy, i.e. it has the same api. But this is not mentioned anywhere in the docs.
**Feature request** When using html output the positioning of the characters can be off, because the displayed font is different from the one that is specified. E.g. see #409. This...