datafusion-comet icon indicating copy to clipboard operation
datafusion-comet copied to clipboard

[EPIC] Improve performance of TPC-DS queries

Open andygrove opened this issue 1 year ago • 0 comments

What is the problem the feature request solves?

This issue is for tracking issues to perform Comet's performance when running benchmarks derived from TPC-DS.

  • [ ] Add support for DPP (Dynamic Partition Pruning) - requires https://github.com/apache/datafusion-comet/issues/121
  • [ ] https://github.com/apache/datafusion-comet/issues/1006
  • [ ] https://github.com/apache/datafusion-comet/issues/846
  • [ ] https://github.com/apache/datafusion-comet/issues/398
  • [ ] https://github.com/apache/datafusion-comet/issues/458
  • [x] Add support for date_add - https://github.com/apache/datafusion-comet/pull/910
  • [ ] stddev_sample disabled by default due to performance issue - https://github.com/apache/datafusion-comet/issues/824
  • [ ] concat is not supported
  • [ ] Unsupported result expressions found in: Vector(MakeDecimal(sum(UnscaledValue(sales_price#3737))#3873L,17,2) AS sales#3747, ...
  • [ ] unsupported Spark data type: StructType(StructField(count(1),LongType,false),StructField(avg(ss_ext_discount_amt),DecimalType(11,6),true),StructField(avg(ss_net_profit),DecimalType(11,6),true))
  • [ ] Window not supported natively but no reason given

Describe the potential solution

No response

Additional context

No response

andygrove avatar Aug 21 '24 15:08 andygrove