spark-sql topic

List spark-sql repositories

pyspark-cheatsheet

355
Stars
120
Forks
Watchers

šŸ Quick reference guide to common patterns & functions in PySpark.

Movies-Analytics-in-Spark-and-Scala

90
Stars
52
Forks
Watchers

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

airbnb-spark-thrift

43
Stars
16
Forks
Watchers

A library for loadling Thrift data into Spark SQL

Spark-Structured-Streaming-Examples

182
Stars
79
Forks
Watchers

Spark Structured Streaming / Kafka / Cassandra / Elastic

redash

25.6k
Stars
4.3k
Forks
573
Watchers

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

spark

2.0k
Stars
310
Forks
Watchers

.NET for ApacheĀ® Sparkā„¢ makes Apache Sparkā„¢ easily accessible to .NET developers.

LearningSparkV2

1.1k
Stars
690
Forks
Watchers

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

data-accelerator

295
Stars
89
Forks
Watchers

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...

bdp

191
Stars
139
Forks
Watchers

A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype

bigdata-playground

205
Stars
73
Forks
Watchers

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apach...