Gang Wu
Gang Wu
I think it is fine to use the same properties across all row groups.
Please ping me when ready for review again. Thanks!
@diego-ciciani01 Feel free to create a PR :)
I think this is allowed from the spec: `A column chunk might be partly or completely dictionary encoded`. I'm not sure if any open source Parquet writer is able to...
I think it makes sense to add this. Please feel free to create a PR.
@steveloughran I have the same feeling. For the reader we already have `ParquetReadOptions`. The writer does not even have this parity.
BTW, could you also add a test case for it?
``` Error: Failed to execute goal com.diffplug.spotless:spotless-maven-plugin:2.30.0:check (default) on project parquet-hadoop: The following files had format violations: Error: src/main/java/org/apache/parquet/hadoop/ParquetFileReader.java Error: @@ -739,7 +739,8 @@ Error: ···*·@return·an·open·ParquetFileReader Error: ···*·@throws·IOException·if·there·is·an·error·while·opening·the·file Error: ···*/...
> Implementations must not write values that are larger than the annotation allows I agree that this should be fixed: https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#unsigned-integers
Could you try to rebase it? I think the CI failure is due to based on a problematic commit.