Gang Wu

Results 304 comments of Gang Wu

TBH, I'm not sure where is the best place to add this because it is not precise to say that statistics of a ColumnChunk is leaf-only. `ColumnChunk` itself is not...

Thanks for reporting this issue! I can confirm that it has been reproduced on my side. Will take a look later.

> I can implement support for this in a branch in DuckDB if that helps get this in. @Mytherin I can help with the PoC implementation with parquet-java and/or parquet-cpp...

@mapleFU Thanks! Then I will focus on parquet-java.

PoC from parquet-java is now alive: https://github.com/apache/parquet-java/pull/3191

Thanks @etseidl for the heads-up! I think this PR needs a rebase before proceeding to the next step?

It looks like a breaking change of timezone name from TZDB. I will take a look. cc @ffacs

https://bugs.launchpad.net/ubuntu/+source/tzdata/+bug/2058249 has explained the root cause that `tzdata` has moved timezone files like `US/Pacific` to a separate `tzdata-legacy` library without providing symlinks by intention so it is a breaking change...

@bdice I think we can keep those files are they are created by legacy writers: ` "format": "0.12", "writer version": "HIVE-8732", "software version": "ORC Java"`. We can use the latest...

I have only found this: https://github.com/apache/parquet-testing/blob/master/data/old_list_structure.md