An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
An open protocol for secure data sharing
Study notes for "Big Data Analysis with Scala and Spark" on Coursera
Data Science: Scala for brave and impatient
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AW...
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
🔎 .NET Core cross-platform, in-memory, full text search library for building search engines. Made to learn C#.