spark-notes icon indicating copy to clipboard operation
spark-notes copied to clipboard

Deep Dive into Apache Spark 深入研读Spark源码

Deep Dive into Apache Spark

These series of spark study notes will deep dive into Apache Spark source codes, which means there will be advanced topics and readers need to be alreadly familiar with Spark APIs.

Table of contents

  1. 搭建Spark源码研读和代码调试的开发环境
  2. 深入理解Spark RDD抽象模型和编写RDD API函数

Roadmap

In progress

  • Execution Model and Job Scheduling

Next

  • Shuffle
  • Spark SQL
  • DataSet
  • Catalyst Optimizer
  • Spark Streaming
  • and so on