spark-rapids
spark-rapids copied to clipboard
repartition-based fallback for hash aggregate
this PR closes https://github.com/NVIDIA/spark-rapids/issues/8391.
this PR add a config called spark.rapids.sql.agg.fallbackAlgorithm to let user decide a sort-based algorithm or repartition-based algorithm to use when agg cannot be done in a single pass in memory.
This optimization is orthogonal to https://github.com/NVIDIA/spark-rapids/pull/10950