asfimport
asfimport
[Kai Liu](https://issues.apache.org/jira/browse/PARQUET-1155?#comment-16992782): Hi, Machiel, Did you get any attraction on this issue? And would you mind share the approach you are taking to address GDPR requirement in your system now?...
[Machiel Groeneveld](https://issues.apache.org/jira/browse/PARQUET-1155?#comment-16997093): Hi `[~zjumad]` there is no news from the Parquet side. Though a recent development in the community to deal with this problem is Delta Lake. They add a...
[Fokko Driesprong](https://issues.apache.org/jira/browse/PARQUET-1155?#comment-17001677) / @Fokko: I don't see this being implemented in Apache Parquet. However, Delta lake, and also Apache Iceberg can solve the issue that you're describing. With Delta, make...
[Raphael Taylor-Davies](https://issues.apache.org/jira/browse/PARQUET-2162?#comment-17559109) / @tustvold: Possibly related **https://issues.apache.org/jira/browse/PARQUET-2139**
[Gang Wu](https://issues.apache.org/jira/browse/PARQUET-2249?#comment-17691009) / @wgtmac: When a page only contains NaN, the page statistics do not set min_value and max_value. However, it is not a null page. IMHO, the ColumnIndex of...
[Jan Finis](https://issues.apache.org/jira/browse/PARQUET-2249?#comment-17691108) / @jfinis: @wgtmac True, not writing a column index in this case is also a solution. Note though that this is a pessimization for pages not containing NaN...
[Xuwei Fu](https://issues.apache.org/jira/browse/PARQUET-2249?#comment-17691179) / @mapleFU: The problem seem to be that, float point is so widely used, but they are "partial order". Seems that iceberg provides NaN counts. And min-max is...
[Jan Finis](https://issues.apache.org/jira/browse/PARQUET-2249?#comment-17691184) / @jfinis: I would be willing to propose a fixing commit for this, but I'm not part of ASF and the whole process, yet, so I don't know...
[Gang Wu](https://issues.apache.org/jira/browse/PARQUET-2249?#comment-17691199) / @wgtmac: As of today, there are many different parquet implementations (java, cpp, rust, etc). Directly making a PR to parquet-format repo is a good start to reach...
[Xuwei Fu](https://issues.apache.org/jira/browse/PARQUET-2249?#comment-17691256) / @mapleFU: I guess NaN is not always larger than all values. 1. Postgres, DB2 and Oracle put NULL higher than any other values 1. MSSQL and MySQL...