spark-notes
spark-notes copied to clipboard
Deep Dive into Apache Spark 深入研读Spark源码
Deep Dive into Apache Spark
These series of spark study notes will deep dive into Apache Spark source codes, which means there will be advanced topics and readers need to be alreadly familiar with Spark APIs.
Table of contents
- 搭建Spark源码研读和代码调试的开发环境
- 深入理解Spark RDD抽象模型和编写RDD API函数
Roadmap
In progress
- Execution Model and Job Scheduling
Next
- Shuffle
- Spark SQL
- DataSet
- Catalyst Optimizer
- Spark Streaming
- and so on