asfimport
asfimport
When merging footers, Parquet doesn't know how to merge conflicting user defined key-value metadata entries, and simply throws. It would be better to provide callbacks to let users define metadata...
When using UnboundRecordFilter with nested AND/OR filters over OPTIONAL columns, there seems to be a case with a mismatch between the current record's column value and the value read during...
The OutputFormat should verify that `parquet.block.size < dfs.blocksize` to avoid bad performance. In addition, we could check that `(dfs.blocksize % parquet.block.size) < 1MB` to ensure that some number of row...
There's currenly no support for conversion to/from Pig datetimes **Reporter**: [Christian Rolf](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=ccrolf) / @ccrolf #### Related issues: - [Sample NanoTime class serializes and deserializes Timestamp incorrectly](https://github.com/apache/parquet-java/issues/1606) (relates to) #### Original...
Vectorized Query Execution could have big performance improvement for SQL engines like Hive, Drill, and Presto. Instead of processing one row at a time, Vectorized Query Execution could streamline operations...
The new filter API seems to be much slower (or perhaps I'm using it wrong \:) Code using an UnboundRecordFilter: ```java ColumnRecordFilter.column(column, ColumnPredicates.applyFunctionToBinary( input -> Binary.fromString(value).equals(input))); ``` vs. code using...
When testing Spark SQL Parquet support, we found that accessing large Parquet files located in S3 can be very slow. To be more specific, we have a S3 Parquet file...
Parquet community [discussed](https://lists.apache.org/thread/sc9zoxd3pd10odtgbrsh5qh7szhl2lhg) and decided ([parquet-\*,](https://lists.apache.org/thread/cxhkft17vx24x1znnvo47hbh6lpqdnrx) [parquet-cpp](https://lists.apache.org/thread/w6r0fwj42wvpm5yflwxhtm2ss3z2hrl3)) to migrate issue tracking from ASF Jira to issue tracker to GitHub issue trackers of individual Apache Parquet repositories with the following component...
**Reporter**: [Fokko Driesprong](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=fokko) / @Fokko **Assignee**: [Fokko Driesprong](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=fokko) / @Fokko **Note**: *This issue was originally created as [PARQUET-2501](https://issues.apache.org/jira/browse/PARQUET-2501). Please see the [migration documentation](https://issues.apache.org/jira/browse/PARQUET-2502) for further details.*
**Reporter**: [Fokko Driesprong](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=fokko) / @Fokko **Assignee**: [Fokko Driesprong](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=fokko) / @Fokko **Note**: *This issue was originally created as [PARQUET-2500](https://issues.apache.org/jira/browse/PARQUET-2500). Please see the [migration documentation](https://issues.apache.org/jira/browse/PARQUET-2502) for further details.*