big-data topic

List big-data repositories

delta

5.5k
Stars
1.2k
Forks
208
Watchers

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

delta-sharing

488
Stars
97
Forks
Watchers

An open protocol for secure data sharing

coursera-spark-notes

12
Stars
7
Forks
Watchers

Study notes for "Big Data Analysis with Scala and Spark" on Coursera

learning-scala-for-data-science

6
Stars
2
Forks
Watchers

Data Science: Scala for brave and impatient

big-data-study

145
Stars
49
Forks
Watchers

:whale: big data study

data-science-ipython-notebooks

24.4k
Stars
7.4k
Forks
Watchers

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AW...

Movies-Analytics-in-Spark-and-Scala

58
Stars
43
Forks
Watchers

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

javaer-mind

66
Stars
38
Forks
Watchers

Java 程序员进阶学习的思维导图

coresearch

30
Stars
9
Forks
Watchers

🔎 .NET Core cross-platform, in-memory, full text search library for building search engines. Made to learn C#.

lcbo-api

153
Stars
40
Forks
Watchers

A crawler and API server for Liquor Control Board of Ontario retail data