pdf2json icon indicating copy to clipboard operation
pdf2json copied to clipboard

Parser not handling some spaces between words

Open simaogf opened this issue 8 years ago • 3 comments

Hello! First of all, congratulations for the good work. However, I found a boring little issue: sometimes the parser does not handle the space between words. This happens when two words come in 2 different json objects. In the example the words "survey" and "published" will be interpretated as one. Any thoughts? Thanks in advance. { "x": 9.672, "y": 13.2, "w": 2.529, "sw": 0.397125, "clr": 0, "A": "left", "R": [ { "T": "survey", "S": -1, "TS": [ 0, 12.48, 0, 0 ] } ] }, { "x": 11.457, "y": 13.2, "w": 38.245, "sw": 0.397125, "clr": 0, "A": "left", "R": [ { "T": "published%20by", "S": -1, "TS": [ 0, 12.48, 0, 0 ] } ] } Cheers.

simaogf avatar Oct 11 '16 10:10 simaogf

I have same issue have you find the answer ? @fernandesSimao

tarun4all avatar Aug 19 '19 09:08 tarun4all

I'm seeing this as well.

dselman avatar Aug 29 '20 15:08 dselman

I also have the same issue. I'm using the latest version 2.1.0

AnjaneyuluBatta505 avatar May 24 '22 07:05 AnjaneyuluBatta505