stream-lib icon indicating copy to clipboard operation
stream-lib copied to clipboard

Stream summarizer and cardinality estimator.

Results 35 stream-lib issues
Sort by recently updated
recently updated
newest added

In my live case, we need BloomFilter for a bigger amount (about 4-5Gb ram, >32B bits) The code is dependent on java.util.BitSet with .ctor `public BitSet(int nbits) ` with a...

We have to store a lot CountMinSketches in heap and we don't need the `long` values in there. Therefore we created a new one that just uses `short` values. It's...

The assumption is that the id is unique. If the id value overflows, it can lead to NullPointerExceptions. Changing the type to long doesn't make overflows impossible, but highly unlikely....

The current implementation has Long type for the internal table. For a lot of real use cases, we need only Int (in general it would be great to have a...

I have had very strange results (very high inaccuracy) for low-cardinality HLL++ when using usual values of p (p = 11, 12 ,13, 14) and sp = 32. I suspect...

We add maven-bundle-plugin to package it as a bundle, as effect to add in MANIFEST.MF osgi metadata. It enables it to be used inside OSGI based applications.

line 91 : method, long com.clearspring.analytics.stream.SampleSet.put(T element, int incrementCount),I think `count++` must be `count+=incrementCount`

There are several classes which require sizing or tuning parameters on construction (for example, HyperLogLog has got `rsd` or `log2m` parameters, CountMinSketch has got `depth`, `width`, and `seed`). There's no...

I just want to add getters on Depth and Width attributes in Count-Min Sketch. It will allow to optimize serialization by pre-computing the number of bytes required in the process....