Ted Malaska
Ted Malaska
CopybookInputFormat
Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...
HBase-ToHDFS
Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet
Spark.TableStatsExample
Simple Spark example of generating table stats for use of data quality checks
SparkOnALog
Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems
SparkOnKudu
Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.
SparkStreaming.Sessionization
NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase
SparkUnitTestingExamples
This project is a collection of Spark Unit Tests Examples to help new Spark users have good examples on how to unit start their code for Spark Core, Spark SQL, and Spark Streaming