pdf.js-extract icon indicating copy to clipboard operation
pdf.js-extract copied to clipboard

Extract images?

Open Zlitus opened this issue 4 years ago • 3 comments

Would be nice if the lib could extract a list of images (identifier/pathnames by pages) and give a way to extract some of them. Especially useful to see when a pdf has 0 text but many images, then a OCR work aside can be started.

Thank you for your work 👍.

Zlitus avatar Jun 22 '20 11:06 Zlitus

Another repo pdf-lib may helps,is there any repo could extract both text and image conveniently?

shartoo avatar Sep 16 '21 00:09 shartoo

my greate author, i want know the lib could extract a list of images , had ok? i want this function。thanks

programmerWhite avatar May 05 '22 08:05 programmerWhite

Sad, that the image stream is not in the data object at all.

Otherwise great lib!

Hellsfoul avatar Mar 03 '23 08:03 Hellsfoul