environment:
pdfquery=0.4.3
python=2.7.15
I use pdfquery to load this pdf, and encounter an error.
error information as follows:
pdf.load()
File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/pdfquery/pdfquery.py", line 385, in load
self.tree = self.get_tree(*_flatten(page_numbers))
File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/pdfquery/pdfquery.py", line 487, in get_tree
for n, page in pages:
File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/pdfquery/pdfquery.py", line 608, in
return (self.get_layout(page) for page in self._cached_pages())
File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/pdfquery/pdfquery.py", line 603, in get_layout
layout = self._add_annots(layout, page.annots)
File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/pdfquery/pdfquery.py", line 655, in _add_annots
elem = parser.makeelement('Annot', annot)
File "src/lxml/parser.pxi", line 1004, in lxml.etree._BaseParser.makeelement
File "src/lxml/apihelpers.pxi", line 131, in lxml.etree._makeElement
File "src/lxml/apihelpers.pxi", line 119, in lxml.etree._makeElement
File "src/lxml/apihelpers.pxi", line 300, in lxml.etree._initNodeAttributes
File "src/lxml/apihelpers.pxi", line 311, in lxml.etree._addAttributeToNode
File "src/lxml/apihelpers.pxi", line 1439, in lxml.etree._utf8
ValueError: All strings must be XML compatible: Unicode or ASCII, no NULL bytes or control characters
I'm having the same issue. Any update?