pdf2json icon indicating copy to clipboard operation
pdf2json copied to clipboard

Parser not handling some spaces between words

Open simaogf opened this issue 7 years ago • 3 comments

Hello! First of all, congratulations for the good work. However, I found a boring little issue: sometimes the parser does not handle the space between words. This happens when two words come in 2 different json objects. In the example the words "survey" and "published" will be interpretated as one. Any thoughts? Thanks in advance. { "x": 9.672, "y": 13.2, "w": 2.529, "sw": 0.397125, "clr": 0, "A": "left", "R": [ { "T": "survey", "S": -1, "TS": [ 0, 12.48, 0, 0 ] } ] }, { "x": 11.457, "y": 13.2, "w": 38.245, "sw": 0.397125, "clr": 0, "A": "left", "R": [ { "T": "published%20by", "S": -1, "TS": [ 0, 12.48, 0, 0 ] } ] } Cheers.

simaogf avatar Oct 11 '16 10:10 simaogf