gazelle_plugin
gazelle_plugin copied to clipboard
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
The queries of TPC-DS with SMJ has obvious performance regression.
**Describe the bug** When running the following sql, the number of result data will increase. When you perform a full join on a table with 10,000 rows of data and...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** In our scenario, users usually read or write hive parquet tables, which...
**Describe the bug** Core Dump(Code:134) with a TypedWholeStageCodeGenImpl::WholeStageCodeGenResultIterator::SetDependencies function fail when running TPC-DS q84 **To Reproduce** This issue can be reproduced in below steps: Under SF500 testing in a single...
**Describe the bug** We meet this problem in a long running application. When the queries, which contains get_json_object in Project or Filter, has been run multiple times, we find the...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** below expression codegen are missing - [ ] lpad - [ ]...
ColumnarBroadcastExchangeExec is redundant to ColumnarBroadcastExchangeExecAdaptor. We can clean up such redundant code.
Signed-off-by: Yuan Zhou ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How was this patch tested? (Please explain how this...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** Currently we use `ValueOrDie` to check the status, such as `LazyBatchIterator.AdvanceTo`, and...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** In WSCG, currently we use batchSize to control the values num in...