datafusion-comet
datafusion-comet copied to clipboard
Improve performance of aggregate expressions by implementing group accumulators
What is the problem the feature request solves?
DataFusion has a fast path for aggregates that implement group accumulators.
See https://github.com/apache/datafusion-comet/issues/1275#issuecomment-2787795808 for one example where we saw a 4x speedup.
We should apply this change to the following Comet aggregates.
- stddev
- correlation
- covariance
- variance
Describe the potential solution
No response
Additional context
No response
I would like implement group accumulators variance