spark-recommender
spark-recommender copied to clipboard
Avoid using groupByKey
Instead using .groupByKey()
we should use .reduceByKey(_ + _)
, to avoid shuffling.
It is used e.g. here:
Reference: prefer_reduceby