big-data topic

List big-data repositories

delta

6.9k
Stars
1.6k
Forks
208
Watchers

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

delta-sharing

682
Stars
147
Forks
Watchers

An open protocol for secure data sharing

coursera-spark-notes

12
Stars
7
Forks
Watchers

Study notes for "Big Data Analysis with Scala and Spark" on Coursera

learning-scala-for-data-science

6
Stars
2
Forks
Watchers

Data Science: Scala for brave and impatient

big-data-study

148
Stars
50
Forks
Watchers

:whale: big data study

data-science-ipython-notebooks

26.5k
Stars
7.7k
Forks
Watchers

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AW...

Movies-Analytics-in-Spark-and-Scala

87
Stars
51
Forks
Watchers

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

javaer-mind

66
Stars
40
Forks
Watchers

Java 程序员进阶学习的思维导图

coresearch

39
Stars
25
Forks
Watchers

🔎 .NET Core cross-platform, in-memory, full text search library for building search engines. Made to learn C#.

lcbo-api

182
Stars
43
Forks
Watchers

A crawler and API server for Liquor Control Board of Ontario retail data