pdf-to-markdown
pdf-to-markdown copied to clipboard
CLI tool
Would be nice to be able to use pdf-to-markdown from command line!
Fwiw we at Open Government Products have forked the repository and created a CLI and library published on npm. We've changed the transformation logic slightly however, so some work is needed to safely backport the changes to this repo
The relevant PRs to backport are: opendocsg#4 opendocsg#7 opendocsg#8 opendocsg#12 opendocsg#15 opendocsg#16 opendocsg#18
Can you provide some small snippet for a basic CLI usage of the modularize branch? It doesn't need to be on npm, I'd be git cloning it, I just cannot figure out how to make it run under node CLI.
I guess Files.test.ts already does something like this under the hood, but it's not clear to me what step is and what step isn't neccesary in a CLI tool.
Hey @hyperknot the modularize branch isn't working yet (and I'm currently not active). So no way using it. Please checkout https://www.npmjs.com/package/@opendocsg/pdf2md