asfimport
asfimport
This is to implement the parquet-cli part of the encryption translation tool. It integrates with key tools to build the encryption properties, handle the parameters and call the parquet-hadoop API...
PARQUET-1851 starts abandon to write parquet files with schema (meta information), but with 0 rows, aka empty files. In result it prevent to store empty tables in DRILL by using...
Hello , we just tried latest apache-arrow version 3.0.0 and the write example included in low level api example, but lz4 still seems not compatible with Hadoop . we got...
Use Java NIO SeekableByteChannel for input to reader/writer instead of the current Parquet-only `Output`/`InputFile` Classes **Reporter**: [David Mollitor](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=belugabehr) / @belugabehr #### Related issues: - [Parquet without Hadoop dependencies](https://github.com/apache/parquet-java/issues/2473) (relates to)...
Currently parquet-mr CI does not run `mvn clean`, modules without changes are not recompiled each time. **Reporter**: [Qinghui Xu](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=q.xu) / @qinghui-xu **Assignee**: [Qinghui Xu](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=q.xu) / @qinghui-xu #### Related issues: -...
**Reporter**: [Xinli Shang](https://issues.apache.org/jira/secure/[email protected]) / @shangxinli #### Related issues: - [Mark Parquet-1872 with not support bloom filter yet ](https://github.com/apache/parquet-java/issues/2553) (Blocked) **Note**: *This issue was originally created as [PARQUET-1875](https://issues.apache.org/jira/browse/PARQUET-1875). Please see the...
I hear the need to port the zstd-jni support to 1.10.x because of easiness to use ZSTD. cc `[~dbtsai]` **Reporter**: [Xinli Shang](https://issues.apache.org/jira/secure/[email protected]) / @shangxinli **Assignee**: [Xinli Shang](https://issues.apache.org/jira/secure/[email protected]) / @shangxinli ####...
Here is the error I got: Pyarrow: ``` >>> df = pd.read_parquet("test.parquet", engine="pyarrow") Traceback (most recent call last): File "", line 1, in File "/home/.local/lib/python3.7/site-packages/pandas/io/parquet.py", line 296, in read_parquet return...
Some personal data columns need to be masked instead of being pruned(Parquet-1791). We need a tool to replace the raw data columns with masked value. The masked value could be...
Add a wrapper around Java NIO Path for `org.apache.parquet.io.OutputFile` and `org.apache.parquet.io.InputFile` **Reporter**: [David Mollitor](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=belugabehr) / @belugabehr #### Related issues: - [make it easy to read and write parquet files in...