Results 126 comments of Michael Eisel

I am curious how the devs of the other libs built on simdjson feel about this, and I still don't know the real-world occurrence rate of non-ASCII JSON. From there,...

Makes sense, and yes I mean keys. I have no strong feelings about it either way, since my library already has a handler for it

Although I will say one thing... if you want to canonicalize after the tape has been written, it's sort of a pain. You can canonicalize where currently we call `strcmp`,...

what if the API allowed the user to pass in a function pointer for the comparison. it would cover the case-insensitive use-case, as well as this use case

by raw utf-8, you mean that they don't use `\u...`? if so, the issue is still that the same unicode character, i.e. grapheme cluster, can be represented by different code...

If you're referring to #1001, a buffer may solve it for some, but it creates a trade-off between efficiency and durability. mmap does not. Although I do see a lot...

To be honest, I wonder if #1001 should just be reverted. It is more complicated, both for the consumers as well as library developers, and less durable, than mmap. What...

As in, #1001 reverted alongside the addition of mmap

The top answer of this question shows how to both and read and write using mmap (although we only need to write). Once you’ve set everything up, you can use...