pdf2json icon indicating copy to clipboard operation
pdf2json copied to clipboard

converts binary PDF to JSON and text, for server-side PDF processing and command-line use.

Results 108 pdf2json issues
Sort by recently updated
recently updated
newest added

* eslint is configured and enabled * typescript is configured and part of build * updated pdfparser.d.ts with more types * previous lib/p2jcmd*.js are replaced with src/cli/p2jcli*.ts * previous root/pdf2json.js...

Works locally, but get this error when deploying to Lambda with esbuild. Using 3.1.2, but have tried other 3.x.x releases. Same issue. Not really sure what to make of this....

I am new to pdf2json and I really love it. I am reading with this tooling a travel brochure reading the text to find some important data. It works fine...

Is there an easy way to process multiple files? I have this: var PDFParser = require('pdf2json'); var pdfParser = new PDFParser(); var fs = require('fs'); var fileNames = []; var...

Just wondering if this is done automatically? Great library by the way.

``` node_modules/pdf3json/lib/pdffont.js:305 var retVal = encodeURIComponent(str); ^ URIError: URI malformed at encodeURIComponent () at cls.flash_encode (/Users/rngadam/coderbunker/prj/EIC/src/eic-cms/eic-cms-cli/node_modules/pdf3json/lib/pdffont.js:305:22) at cls.processText (/Users/rngadam/coderbunker/prj/EIC/src/eic-cms/eic-cms-cli/node_modules/pdf3json/lib/pdffont.js:283:25) at CanvasRenderingContext2D_.contextPrototype.fillText (/Users/rngadam/coderbunker/prj/EIC/src/eic-cms/eic-cms-cli/node_modules/pdf3json/lib/pdfcanvas.js:204:26) at CanvasGraphics_showText [as showText] (eval at (/Users/rngadam/coderbunker/prj/EIC/src/eic-cms/eic-cms-cli/node_modules/pdf3json/lib/pdf.js:60:1), :41251:25)...

Hi All, I have a PDF file that contains about 500 pages (3.6mb) - I can't post because it contains sensitive data. When I load it up through pdf2json, it...

When a text segment is bold and/or italic, the respective elements on "TS" element are always set to 0 (false). Here's the output: `{ "oc": "#000073", "x": 2.09, "y": 14.46,...