datasketches-cpp
datasketches-cpp copied to clipboard
Core C++ Sketch Library
Trying to build python gives errors. Fixed by including in the wrapper itself, but our files shouldn't be missing needed headers.
Compare the performance of t-Digest with the closest competitor in the library, REQ sketch. REQ sketch is the closest competitor because it prioritizes high rank accuracy (HRA mode) or low...
Hi there, We are consider using datasketches in our C++ environment. The current organization, where there a bunch of folders, each with an `include/` directory, is a bit awkward for...
Determinism
Hi, I am interested in using this amazing project for a distributed machine learning application. My only blocker to doing this is the ability to reproduce results for a given...
Hello, There's a consideration at [XGBoost](https://github.com/dmlc/xgboost/issues/5746) about potentially using the KLL sketch to represent feature value histograms. One potential blocker is the need for a weighted version of the sketch,...
## Proposal: Add DDSketch (Relative-Error Quantile Sketch) **Summary:** This issue proposes adding an implementation of [DDSketch](https://www.vldb.org/pvldb/vol12/p2195-masson.pdf), a mergeable quantile sketch with relative-error guarantees, to the `datasketches-cpp` library. Benefits: - Relative-error...
DDSketch
## Description This PR implements **DDSketch** and closes #457. The design follows the core ideas of the original DDSketch paper and the implementations in other libraries. I have aimed to...
Hi DataSketches team, I’m working with the HLL sketch family (HLL4, HLL6, HLL8) and need a clear understanding of the serialized binary layout, including: - header structure - Register arrays...
Description When compiling code that includes and uses the hll_sketch class with the Clang compiler, a -Wunnecessary-virtual-specifier warning is generated. The warning points to the destructor of the hll_sketch_alloc class,...