mars icon indicating copy to clipboard operation
mars copied to clipboard

Using aggregation instead of transform to perform `df.groupby().nunique()`

Open qinxuye opened this issue 2 years ago • 0 comments

Now, df.groupby().nunique() would be delegated to transform to perform execution, it will be a shuffle operation which is very time consuming, we can delegate it to aggregation which is way more optimized.

qinxuye avatar Apr 26 '22 07:04 qinxuye