pdf.js-extract issues

Extract images?

3

Would be nice if the lib could extract a list of images (identifier/pathnames by pages) and give a way to extract some of them. Especially useful to see when a...

Zlitus

help wanted

It does not work for remote pdf files? ENOENT error

1

```ts import { PDFExtract, PDFExtractOptions } from 'pdf.js-extract'; const pdfExtract = new PDFExtract(); const options: PDFExtractOptions = { }; async function main() { const res = await pdfExtract.extract('https://www.w3.org/WAI/ER/tests/xhtml/testfiles/resources/pdf/dummy.pdf', options); }...

DavideViolante

Is there a way to get the fill color for text?

Basically the title, I am using typescript so maybe the types are not up to day but i don't see a way to get the color information for a text

isaacfink

Fix for Y coordinate with new version of pdfjs

4

Hi @ffalt thank you a lot for this project. I have successfully been using your `extractBuffer` function in a browser environment. Working with pdfjs-dist V4.0.269 I noticed that the y...

MCMattia

Get Coordinates of Each word.

1

Hi, Is it possible to get coordinates of each word in the PDF. "Hello, world!" output is a chunk of words, I want to extract each word as one separate...

Ana0112