datafusion icon indicating copy to clipboard operation
datafusion copied to clipboard

Incorrect statistics read for binary columns in parquet

Open NGA-TRAN opened this issue 9 months ago • 0 comments

Describe the bug

I found this while adding tests for reading parquet statistics https://github.com/apache/datafusion/pull/10592. Instead of getting back BinaryArray, we get StringArray

To Reproduce

See test test_byte in PR https://github.com/apache/datafusion/pull/10592

Expected behavior

Mins and maxes in BinaryArray data type

Additional context

No response

NGA-TRAN avatar May 21 '24 16:05 NGA-TRAN