Menextract2pdf Exported files without text

Exported files without text

Open falbarelli opened this issue 5 years ago • 4 comments

Hello, thanks for this useful tool.

I am using ubuntu and I used the script "menextract2pdf__overwrite.sh" to overwrite my mendeley library (with the aim of importing it in Zotero, using their native Mendeley importer).

Unfortunately all the pdf files with comments do not have any text at all, you can only see the highlighted portions (but with no text). Is this a known problem?

See one example file here.

Nov 02 '19 18:11 falbarelli

OK that's interesting. I'll have a look. What's the output of the script?

Nov 04 '19 12:11 cycomanic

Hi, sorry for the late reply. I did not have time to play with the script again. However, I run the actual python script instead of the bash script for overwriting and it worked fine on the same machine... weird. I'll try again in the next days and I'll let you know.

Nov 06 '19 09:11 falbarelli

I experienced the same issue. Running "menextract2pdf__overwrite.sh" or the "menextract2pdf.py" with --overwrite both produced files with only annotations and none of the original document content. Without any attempt to overwrite (i.e., just running menextract2pdf.py) the process completes as expected. It's easy enough to overwrite manually, so thanks for making this script to take care of the difficult part!

Nov 14 '19 17:11 caufieldjh

I met the same problem, too. A brutal force solution can be like this:

you can change line 170

outpdf.write(open(fn_out, "wb"))

into

outpdf.write(open(fn_out+"new.pdf", "wb"))

and delete line 165

return

Feb 23 '21 07:02 chuchong

Menextract2pdf Menextract2pdf copied to clipboard

Exported files without text

Menextract2pdf
Menextract2pdf copied to clipboard