Gang Wu
Gang Wu
Thanks @paleolimbot for the detail feedback! I have changed the name and removed controversial items. I will start a PoC implementation in parquet-java later this week if there is no...
FYI: Parquet community is working together with GeoParquet and Iceberg community to propose a new geometry logical type: https://github.com/apache/parquet-format/pull/240
Thanks @zxf216! Let me take a first pass. cc @coderex2522 @ffacs
I'll try to update the site for parquet-cpp and parquet-java later this month. Update: created https://github.com/apache/parquet-site/pull/100
Could you provide more context? How did you tell the C++ reader to read only row 100 and 130 out of an ORC file?
Thanks @ffacs and @dongjoon-hyun! I just merged it
Thanks for your interest in contributing this! This seems to be a large feature and the performance gain is promising! However, I'm afraid that this PR may not get prompt...
Thanks for creating an issue wrt to a new compression codec. There was a similar discussion happened on [email protected]: https://lists.apache.org/thread/ht95wm8trfx2z4pq91t7170t2qjqg4yw. I think the replies have provided some general concerns of...
It seems that we can only control dictionary encoding and byte stream split encoding via ParquetProperties: https://github.com/apache/parquet-java/blob/master/parquet-column/src/main/java/org/apache/parquet/column/ParquetProperties.java. Other encoding types are enabled via WriterVersion: https://github.com/apache/parquet-java/blob/master/parquet-column/src/main/java/org/apache/parquet/column/values/factory/DefaultValuesWriterFactory.java
Sure, let me try it on my own.