Daniel Lemire comments

Results 1863 comments of


                                            Daniel Lemire

Adding `is_base64(const char*, size_t)` API

@anonrig That's a great idea. I wonder if we couldn't do something better: validate and return the exact size of the inputs (minus white spaces). Do you have an application...

Adding `is_base64(const char*, size_t)` API

@anonrig Oh! So you actually want to decode base64 data that has been percent encoded? So spaces are likely not relevant in this context because you'd probably not do percent...

Adding `is_base64(const char*, size_t)` API

> I think improving Ada's performance might be a better place than adding it to simdutf Granted. This being said, transcoding with replacement might be even more critical. 🚤

add multithreading for large inputs?

@ronag How large are you thinking about? It takes thousands on nanoseconds to start a thread. Up to, say, 200,000 ns on some systems (it varies greatly). And you haven't...

add multithreading for large inputs?

@ronag > I'm thinking this might make sense if the overhead is negligible at sizes of ~128k. I agree and it answers my question (*How large are you thinking about?*)....

selectFrom and rearrange (vectorized lookup tables) are going to be slow

@arouel I definitively think that it is worth following up... especially as new Java releases come around.

add pdep/pext compress implementation

We do not currently target BMI2, see this line... https://github.com/simdjson/simdjson/blob/e341c8b43861b43de29c48ab65f292d997096953/include/simdjson/haswell/begin.h#L7 We do not do so because we don't use BMI2 in the haswell kernel. Note that if we are going...