simdjson-java Lazy Parsing

Lazy Parsing

Open zekronium opened this issue 8 months ago • 9 comments

Hi,

My understanding that the key benefit of SIMD is that we can "progressively" parse a stream of JSON hence the tape reading like implementation.

Although in benchmarks with varying stop points, as in parsing only 20/50/80% of JSON and exiting early, the throughput seems to be almost the same and directly correlating to the size of the json.

If I pre-pad the array like what it does, then I get a more realistic result with varying throughput depending how deep the parsing goes but the general throughput still stays roughly the same. Is there alot of pre-parsing going on? Screenshot 2023-12-15 at 16 31 07

The bars are different size of json

Dec 16 '23 00:12 zekronium

simdjson-java simdjson-java copied to clipboard

Lazy Parsing

simdjson-java
simdjson-java copied to clipboard