pdfparser icon indicating copy to clipboard operation
pdfparser copied to clipboard

Python binding to libpoppler with focus on text extraction

Results 6 pdfparser issues
Sort by recently updated
recently updated
newest added

I am trying to install a tool that makes of use of a version of pdfparser from some years ago. The developer has advised me to install the newest version...

[schutzer-2010-eurjclinpharmacol-v66.pdf](https://github.com/izderadicka/pdfparser/files/3893512/schutzer-2010-eurjclinpharmacol-v66.pdf) only 2] is detected but no other blue characters were detected their rgb values are 0.7,0,7,0,7. How do i detect correct color of text

bug

Added support for extraction of font boolean atrributes like bold and italic (from textfontinfo class). Note that experiments revealed that these attributes will surely be True positive but can be...

import pdfparser.poppler as pdf pdf.Document(r"Manish.pdf") Traceback (most recent call last): File "", line 1, in File "pdfparser/poppler.pyx", line 116, in pdfparser.poppler.Document.__cinit__ TypeError: expected bytes, str found Its in redhat Can...

enhancement
Python 3

I want to install it on windows. Is it possible?

enhancement
question

Hi, I'm not great at c/c++ but I've been reading the poppler source trying learn how things work. Do you know of any way to give the pdf to poppler...

enhancement
question