tdigest icon indicating copy to clipboard operation
tdigest copied to clipboard

t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark

Results 15 tdigest issues
Sort by recently updated
recently updated
newest added

`Traceback (most recent call last): File "", line 2, in File "/usr/local/lib/python2.7/dist-packages/tdigest/tdigest.py", line 112, in update self._add_centroid(Centroid(x, w)) File "/usr/local/lib/python2.7/dist-packages/tdigest/tdigest.py", line 67, in _add_centroid self.C.insert(centroid.mean, centroid) File "accumulation_tree/accumulation_tree.pyx", line 233,...

https://github.com/tdunning/t-digest/blob/master/docs/t-digest-paper/histo.pdf

I have [my_set.zip](https://github.com/CamDavidsonPilon/tdigest/files/2613099/my_set.zip). When I'm using the java code: ```java import com.tdunning.math.stats.TDigest; import org.apache.commons.csv.CSVFormat; import org.apache.commons.csv.CSVRecord; import java.io.*; import java.util.stream.StreamSupport; public class TDigestTry { public static void main(String[] args) throws...

If I'm not mistaken the weights are in wrong order.

Hi I noticed that the code is not following up with its Java partner. Are you still maintaining it?

Just saying hi. This is very nice work indeed. The application of your work at http://dev.microprediction.org/crawling.html may be more than obvious as this is essentially an online CDF estimation contest...

I ran the tests below and found out that on PyPy tdigest is horribly slow. ```python # -*- coding: utf-8 -*- from __future__ import print_function import sys from tdigest import...

It seems like half the time just the one test, test_uniform fails, and other times it's fine. This issue is just to track that problem with test_uniform. We should figure...

It would be really nice if tags were used in this project so that one could easily see if the current version 0.4.1.0 on PyPI actually refers to which commit...

Hi, I believe it will be nice and useful to add also conda distribution. (can help with that if needed)