isarn-sketches-spark icon indicating copy to clipboard operation
isarn-sketches-spark copied to clipboard

Routines and data structures for using isarn-sketches idiomatically in Apache Spark

Results 7 isarn-sketches-spark issues
Sort by recently updated
recently updated
newest added

Great library, but we need an official version for Spark 3.2. Thanks!

Hi, @erikerlandson , @JonathanTaws , I'd like to report a vulnerable dependency in **org.isarnproject:isarn-sketches-spark_2.12:0.5.2-sp3.0**. ### Issue Description I noticed that **org.isarnproject:isarn-sketches-spark_2.12:0.5.2-sp3.0** directly depends on **org.apache.spark:spark-core_2.12:3.0.1** in the [pom](https://repo1.maven.org/maven2/org/isarnproject/isarn-sketches-spark_2.12/0.5.2-sp3.0/isarn-sketches-spark_2.12-0.5.2-sp3.0.pom). However, as...

Kullback-Leibler Divergence Estimation of Continuous Distributions Fernando P ́erez-Cruz Department of Electrical Engineering Princeton University Princeton, New Jersey 08544 Email: [email protected] http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.422.5121&rep=rep1&type=pdf

If I can figure out how to do this, it would make things cleaner, e.g. #8

Easy in scala - have to figure out how to do this in pyspark layer.

I had a request to serialize the Dataframes resulting from TDigest UDAFs directly, to something like parquet or other formats.

Hi. I tried to use `TDigestUDT` in Pyspark and got a class not found error. So I changed the Scala classpath and it worked. Please consider merging this to fix...