Gang Wu
Gang Wu
TBH, I'm not sure where is the best place to add this because it is not precise to say that statistics of a ColumnChunk is leaf-only. `ColumnChunk` itself is not...
Thanks for reporting this issue! I can confirm that it has been reproduced on my side. Will take a look later.
> I can implement support for this in a branch in DuckDB if that helps get this in. @Mytherin I can help with the PoC implementation with parquet-java and/or parquet-cpp...
@mapleFU Thanks! Then I will focus on parquet-java.
PoC from parquet-java is now alive: https://github.com/apache/parquet-java/pull/3191
Thanks @etseidl for the heads-up! I think this PR needs a rebase before proceeding to the next step?
It looks like a breaking change of timezone name from TZDB. I will take a look. cc @ffacs
https://bugs.launchpad.net/ubuntu/+source/tzdata/+bug/2058249 has explained the root cause that `tzdata` has moved timezone files like `US/Pacific` to a separate `tzdata-legacy` library without providing symlinks by intention so it is a breaking change...
@bdice I think we can keep those files are they are created by legacy writers: ` "format": "0.12", "writer version": "HIVE-8732", "software version": "ORC Java"`. We can use the latest...
I have only found this: https://github.com/apache/parquet-testing/blob/master/data/old_list_structure.md