Gang Wu
Gang Wu
@rok @gszadovszky @Fokko @julienledem Would you mind taking a look?
Thanks for the suggestion! I've simplified the wording and removed unimportant comments. PTAL. @Fokko @gszadovszky
@Fokko Gentle ping :)
Thanks @Fokko for merging!
Thanks @paleolimbot for the quick review! I've addressed your feedback and also added page index support for geometry stats. BTW, it is a convention that we need PoC on another...
Thanks @jiayuasu for your valuable feedback! Yes, it would be great if we can get direct help from geospatial experts!
Close this in favor of https://github.com/apache/parquet-java/pull/2971
``` Error: Failed to execute goal com.diffplug.spotless:spotless-maven-plugin:2.30.0:check (default) on project parquet-hadoop: The following files had format violations: Error: src/test/java/org/apache/parquet/format/converter/TestParquetMetadataConverter.java Error: @@ -1277,9 +1277,8 @@ Error: ····return·createParquetMetaData(dicEncoding,·dataEncoding,·true); Error: ··} Error: Error:...
Do you have any concrete suggestion on what value to provide? My concern is that changing the behavior may affect a lot of downstream applications in the wild without notice.
ColumnChunk statistics is always for the leaf columns. I'm not familiar with DataFrame. The data in your example contains two lists: `[1, 2, 3]` and `null`. `null_count` includes null values...