datafusion
datafusion copied to clipboard
Apache DataFusion SQL Query Engine
Closes #15220 A lot of the work of this PR is meant to resolve https://github.com/apache/datafusion/issues/15220#issuecomment-2727534085. I think I'll move that into a standalone PR.
### Is your feature request related to a problem or challenge? - Part of https://github.com/apache/datafusion/pull/16300/files While testing https://github.com/apache/datafusion/pull/16300, I (re-noticed) that datafusion-cli is really slow when reading remote files For...
## Which issue does this PR close? - Closes https://github.com/apache/datafusion/issues/16054. ## Rationale for this change ## What changes are included in this PR? * Adds a regression test * Always...
### Is your feature request related to a problem or challenge? This is an idea that @robtandy brought up on the DataFusion sync call the other day and I think...
### Is your feature request related to a problem or challenge? https://x.com/andrewlamb1111/status/1925537738360504663 > ClickBench keeps me convinced that Parquet can be quite fast. There is only a 2.3x performance difference...
### Is your feature request related to a problem or challenge? The current Nested Loop Join implementation follows this simplified logic: 1. Buffer the Build Side: All data from the...
### Is your feature request related to a problem or challenge? It's currently not possible to aggregate by `RunArrays`. Example code grouping by a `RunArray` ```rust use arrow::array::{Array, Int32Array, RunArray,...
### Describe the bug 1. make test csv ``` import csv import random import decimal random.seed(42) def make_big_random_decimal(): n = random.randint(1, 1 Result { let ctx = SessionContext::new(); let schema...
## Which issue does this PR close? - Closes #15161. ## Rationale for this change Currently, DataFusion handles comparisons between numbers and string literals differently from a number of databases....
## Which issue does this PR close? - Closes #15596 ## Rationale for this change Explained in issue. Common in other cli programs implementing querying of data. ## What changes...