Daniel Lemire
Daniel Lemire
Currently, the code is only tested on little-endian hardware. We would need to run tests and to flip byte order (to preserve interoperability) on big-endian hardware.
The current lookup tables are quite large. Finding a way to substantially reduce their memory usage without adversally affecting performance would be a worthy goal.
Some look-ups could be efficiently replaced by fast instructions such as a [``pdep``](https://software.intel.com/en-us/node/523995) followed by a multiplication and a shift. It is unlikely to be generally faster than a look-up,...
Following this PR https://github.com/lemire/streamvbyte/pull/26 we now have code that can use a 0,1,2,4 byte encoding. However, it is basically achieved through pure code duplication. Worse: it does not benefit from...
The generic codec supports both x64 and ARM NEON, however the differential-encoded version is x64 only. It seems like it would be easy to port them over. The Delta function...
See http://bitmagic.io/dna-search.html
Generally, CRoaring does far too many heap allocations. This slows down the code in some instances, and it can make it more fragile. In many instances, especially when computing intersections,...
This verifies that Xcode 10 is not sufficient.