pdf.js-extract
pdf.js-extract copied to clipboard
Extract images?
Would be nice if the lib could extract a list of images (identifier/pathnames by pages) and give a way to extract some of them. Especially useful to see when a pdf has 0 text but many images, then a OCR work aside can be started.
Thank you for your work 👍.
Another repo pdf-lib may helps,is there any repo could extract both text and image conveniently?
my greate author, i want know the lib could extract a list of images , had ok? i want this function。thanks
Sad, that the image stream is not in the data object at all.
Otherwise great lib!