pdf-to-markdown
pdf-to-markdown copied to clipboard
is there any option to convert pdf to mark down with embedded images
Hi! This version can only convert to text, so is there any way/option to convert with media(images...)
Thanks
Currently not, sorry. I've thought about it... think pdf.js allows to extract media... but haven't tried it and for my use it was irrelevant...
Table data would be great too
@jzillmann Would you accept a bounty for this feature ?
@berserkwarwolf What exactly ?
- You want media extracted ?
- You want media extracted and included in the markdown as links (probably downloaded as a folder) ?
- Table data ?
https://pdfbox.apache.org/
REM for %f in (*.pdf) do extract "%f"
java -jar pdfbox-app-2.0.24.jar ExtractImages %1