spark-ml topic
dlsa
Distributed least squares approximation (dlsa) implemented with Apache Spark
isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Spark-Scala-EKS
Spark Scala docker container sample for AWS testing - EKS & S3
pre-lt-raster-frames
Spark DataFrames for earth observation data
YelpDatasetChallenge
Restaurant recommendations and review text-based quality predictions
pySpark_tutorial
Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning
fdp-modelserver
An umbrella project for multiple implementations of model serving