datafusion
datafusion copied to clipboard
Apache DataFusion SQL Query Engine
I think it would be super valuable to write a blog post about all the work from @thinkharderdev @Ted-Jiang, @tustvold and others to make reading from parquet in DataFusion very...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** We implement user defined aggregates in IOx called "selector functions" -- to...
In https://github.com/apache/arrow-datafusion/pull/3380 @thinkharderdev added support for evaluating filters during the parquet scan via the RowIndex mechanism 🎉 This feature is currently enabled via a feature flag, which is disabled by...
**Describe the bug** Row Hash aggregation, loads whole aggregation state to memory before sending a single batch downstream. The resulting record batch will have more rows than predefined batch size...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** A clear and concise description of what the problem is. Ex. I'm...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** A clear and concise description of what the problem is. Ex. I'm...
# Which issue does this PR close? N/A # Rationale for this change It is getting challenging to create PRs that depend on recent changes in dependencies such as `arrow`...
Adds specific note on lowercase treatment of columns on SQL # Which issue does this PR close? Closes #2374 # Rationale for this change This example is the first thing...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** A clear and concise description of what the problem is. Ex. I'm...
**Describe the bug** A clear and concise description of what the bug is. When changing the TPC-H tests to use Decimals, getting panics with this error: ``` thread 'sql::subqueries::tpch_q17_correlated' panicked...