datafusion
datafusion copied to clipboard
Apache DataFusion SQL Query Engine
## Which issue does this PR close? Closes #10589 ## Rationale for this change Provide per-column key/value options in the `CREATE EXTERN TABLE` statement. ## What changes are included in...
### Describe the bug I'm running the following query: ```sql select span_name from records order by bit_length(attributes) desc limit 20 ``` And it's running out of memory with 20GB memory...
## Which issue does this PR close? related to https://github.com/apache/datafusion/issues/10616 ## Rationale for this change While working on https://github.com/apache/datafusion/pull/10549 I kept getting confused about what an `ObjectStoreUrl` was / is....
## Which issue does this PR close? Tests for https://github.com/apache/arrow-datafusion/issues/8155 ## Rationale for this change I have found that statistics were lost being propagated upward in the plan. These are...
## Which issue does this PR close? Closes https://github.com/apache/arrow-datafusion/issues/9090. ## What changes are included in this PR? **This is still work in progress.** The PR introduces a few simple optimizations...
## Which issue does this PR close? Closes #10599 ## Rationale for this change ## What changes are included in this PR? ## Are these changes tested? Not yet, but...
### Describe the bug During working on https://github.com/apache/datafusion-comet/pull/437, a few Spark join tests are failed when delegating to DataFusion HashJoin. It is because that DataFusion HashJoin LeftAnti Join returns incorrect...
## Which issue does this PR close? Closes #10583. ## Rationale for this change ## What changes are included in this PR? ## Are these changes tested? ## Are there...
### Describe the bug I found this bug while adding tests for reading parquet statistics https://github.com/apache/datafusion/pull/10592/. Instead of getting corresponding UInt8Array, UInt16Array, UInt32Array for columns with u8, u16, u32 data...
### Describe the bug I found this while adding tests for reading parquet statistics https://github.com/apache/datafusion/pull/10592. Instead of getting back `BinaryArray`, we get `StringArray` ### To Reproduce See test `test_byte` in...