mapreduce topic
Coursera_Bigdata_UCSD
UCSD Big Data Specialization General Materials and my Capstone Project.
Coursera-UW-Machine-Learning-Clustering-Retrieval
gomap
Run your MapReduce workloads as a single binary on a single machine with multiple CPUs and high memory. Pricing of a lot of small machines vs heavy machines is the same on most cloud providers.
mit-6.824-distributed-systems
Template repository to work on the labs from MIT 6.824 Distributed Systems course.
interview-refresher-java-bigdata
a one-stop repo to lookup for code snippets of core java concepts, sql, data structures as well as big data. It also consists of interview questions asked in real-life.
market-basket-analysis
Hadoop MapReduce implementation of Market Basket Analysis for Frequent Item-set and Association Rule mining using Apriori algorithm.
dijkstra-hadoop-spark
Dijkstra Algorithm - Python Hadoop Streaming and Pyspark
lectures-hse-spark
Масштабируемое машинное обучение и анализ больших данных с Apache Spark