pdf2json icon indicating copy to clipboard operation
pdf2json copied to clipboard

converts binary PDF to JSON and text, for server-side PDF processing and command-line use.

Results 108 pdf2json issues
Sort by recently updated
recently updated
newest added

Why is the default module suddenly 1.75MB and has no default export?

Hi I am a newbie to React and Node.js. I am using the PDF2JSON module as part of my development. When I build the application I get the below error....

-small updates ---removed some test functions ---commonjs easyparser.js ---- thanks

when trying to call this function `this.PDFParser.parseBuffer(PDF)` server got stuck. even setTimeout is not working and not getting api call. overall server freeze and I had to restart server. Appreciate...

I'm experiencing some broken text blocks problems with the latest version of the library. The pdf looks like this and this is what I'm getting, as an example: `{ "x":...

Number from my pdf file are coming as null. These numbers sytle is showing -1. After conversion to JSON , I have this. {"x":6.078,"y":3.2969999999999997,"w":5.6,"oc":"#747474","sw":0.3450625,"A":"left","R":[{"T":"**%00"**,"S":-1,"TS":[2,13,0,0]}]}],"Fields":[],"Boxsets":[]}]} Please guide how to solve this.

[ill.pdf](https://github.com/modesty/pdf2json/files/14447093/ill.pdf) ![image](https://github.com/modesty/pdf2json/assets/130582247/b4a7ad54-798d-41d3-99e7-f5ac200df8c7) pdf-testfile: Minimal set with UTF-8 characters encoded [ill01.pdf](https://github.com/py-pdf/pypdf/files/14459161/ill01.pdf) [ill00.pdf](https://github.com/py-pdf/pypdf/files/14459246/ill00.pdf) to start the extract of text, use: `node pdf2json -cvf ill01.pdf` **I expect true character mappings if there are...

Dear all, I like to know how to detect the hLines correctly. - Which module does it? - How to improve it? See the picture: No HLine deteced. The Fille...

Hello, recently I found bug with parsing some .pdf files. If file have symbols colored in semi-transparent color, a program just stops without any error messages. Here's code which stops...

So I have a PDF with just one field on it - a field named "xxx.yyy". When I run pdf2json 3.0.5 on the PDF I'm told that the only field...