pdf2json
pdf2json copied to clipboard
Parser not handling some spaces between words
Hello! First of all, congratulations for the good work. However, I found a boring little issue: sometimes the parser does not handle the space between words. This happens when two words come in 2 different json objects.
In the example the words "survey" and "published" will be interpretated as one. Any thoughts?
Thanks in advance.
{ "x": 9.672, "y": 13.2, "w": 2.529, "sw": 0.397125, "clr": 0, "A": "left", "R": [ { "T": "survey", "S": -1, "TS": [ 0, 12.48, 0, 0 ] } ] }, { "x": 11.457, "y": 13.2, "w": 38.245, "sw": 0.397125, "clr": 0, "A": "left", "R": [ { "T": "published%20by", "S": -1, "TS": [ 0, 12.48, 0, 0 ] } ] }
Cheers.
I have same issue have you find the answer ? @fernandesSimao
I'm seeing this as well.
I also have the same issue. I'm using the latest version 2.1.0