datafusion-comet
datafusion-comet copied to clipboard
[EPIC] Improve performance of TPC-DS queries
What is the problem the feature request solves?
This issue is for tracking issues to perform Comet's performance when running benchmarks derived from TPC-DS.
- [ ] Add support for DPP (Dynamic Partition Pruning) - requires https://github.com/apache/datafusion-comet/issues/121
- [ ] https://github.com/apache/datafusion-comet/issues/1006
- [ ] https://github.com/apache/datafusion-comet/issues/846
- [ ] https://github.com/apache/datafusion-comet/issues/398
- [ ] https://github.com/apache/datafusion-comet/issues/458
- [x] Add support for date_add - https://github.com/apache/datafusion-comet/pull/910
- [ ] stddev_sample disabled by default due to performance issue - https://github.com/apache/datafusion-comet/issues/824
- [ ] concat is not supported
- [ ] Unsupported result expressions found in:
Vector(MakeDecimal(sum(UnscaledValue(sales_price#3737))#3873L,17,2) AS sales#3747, ... - [ ] unsupported Spark data type:
StructType(StructField(count(1),LongType,false),StructField(avg(ss_ext_discount_amt),DecimalType(11,6),true),StructField(avg(ss_net_profit),DecimalType(11,6),true)) - [ ] Window not supported natively but no reason given
Describe the potential solution
No response
Additional context
No response