data-engineering-zoomcamp icon indicating copy to clipboard operation
data-engineering-zoomcamp copied to clipboard

Timecodes for "DE Zoomcamp 5.4.2 - GroupBy in Spark"

Open alexeygrigorev opened this issue 2 years ago • 1 comments

Youtube video: https://www.youtube.com/watch?v=9qrDsY_2COo

alexeygrigorev avatar May 20 '22 05:05 alexeygrigorev

0:00:00 - Spark group by query explained. 0:02:05 - Data analysis and filtering process. 0:04:18 - Order by, group by explanation. 0:06:19 - Combining subresults for group by. 0:08:10 - Reshuffling: Partitioning and Sorting Algorithm. 0:10:13 - Reshuffling, combining, ordering, filtering, repartitioning. 0:12:12 - Shuffling data for optimization.

dimzachar avatar Sep 09 '23 16:09 dimzachar

Updated, thank you!

amitfrancis avatar Jan 15 '24 11:01 amitfrancis