Gang Wu

Results 304 comments of Gang Wu

Could you please elaborate the trouble in FLBA + DELTA_BYTE_ARRAY? How can we reproduce it?

Thanks for the information! Have you tried to use https://github.com/apache/parquet-java/tree/master/parquet-cli to read it? Unfortunately, I don't have a configured setup for these non-arrow C++ reader (ClickHouse, DuckDB, Velox). It would...

Thanks. I thought it was a bug produced by the Parquet writer and now it is clear that it is a missing feature from those engines.

Sorry for chiming in late. From all comments here and from https://github.com/apache/parquet-format/pull/240, I think I need to clarify what parquet can do and cannot do with a new geometry type....

> I'd lean towards putting as much of the geoparquet column level metadata as we can into geometry logical type. Sounds good! But I'd like to mention that parquet logical...

Update: I have simplified the proposal: https://github.com/apache/parquet-format/pull/240. Please let me know what you think. Thanks!

@jorisvandenbossche @paleolimbot I agree that finding a good way to represent mixed geometries natively is a good idea. I think this does not mean we can get away with WKB...

In general, dictionary encoding consumes a lot of memory due to buffering all entries. So yes, `withDictionaryEncoding(false)` is the right approach to reduce the memory footprint in your case. For...

@ConeyLiu IIUC, parquet-cli (which uses ParquetAvroReader) might also hit this issue?