pdf.js-extract Extract images?

Extract images?

Open Zlitus opened this issue 4 years ago • 3 comments

Would be nice if the lib could extract a list of images (identifier/pathnames by pages) and give a way to extract some of them. Especially useful to see when a pdf has 0 text but many images, then a OCR work aside can be started.

Thank you for your work 👍.

Jun 22 '20 11:06 Zlitus

Another repo pdf-lib may helps,is there any repo could extract both text and image conveniently?

Sep 16 '21 00:09 shartoo

my greate author, i want know the lib could extract a list of images , had ok? i want this function。thanks

May 05 '22 08:05 programmerWhite

Sad, that the image stream is not in the data object at all.

Otherwise great lib!

Mar 03 '23 08:03 Hellsfoul

pdf.js-extract pdf.js-extract copied to clipboard

Extract images?

pdf.js-extract
pdf.js-extract copied to clipboard