pdf2json
pdf2json copied to clipboard
complete pdf content into json format
I am using this module to get complete content from pdf in json format but only fields save in json format without content i am facing this issue , any one can help ?
{ "formImage": { "Transcoder": "[email protected] [https://github.com/modesty/pdf2json]", "Agency": "", "Id": { "AgencyId": "", "Name": "", "MC": false, "Max": 1, "Parent": "" }, "Pages": [ { "Height": 49.5, "HLines": [], "VLines": [], "Fills": [ { "x": 0, "y": 0, "w": 0, "h": 0, "clr": 1 } ], "Texts": [], "Fields": [], "Boxsets": [] }, { "Height": 63, "HLines": [], "VLines": [], "Fills": [ { "x": 0, "y": 0, "w": 0, "h": 0, "clr": 1 } ], "Texts": [], "Fields": [], "Boxsets": [] } ], "Width": 38.25 } }
you can try pdftotext such as poppler-utils provided to deal with your input pdf, perhaps pdf file is just broken some way for pdf.js to extract