parquet-java icon indicating copy to clipboard operation
parquet-java copied to clipboard

Use airlift non-native implementations for GZIP, LZ0 and LZ4 codecs

Open asfimport opened this issue 6 years ago • 1 comments

@rdblue  pointed me to https://github.com/airlift/aircompressor which provides non-native implementations of compression codecs. It claims to be much faster than native wrappers that parquet uses. This Jira is to track the work needed for exploring using these codecs, getting benchmark results and making changes including not needing to pool compressors and decompressors anymore. Note that this doesn't include SNAPPY since Parquet already has its own non-hadoopy implementation for it. 

Reporter: Samarth Jain / @samarthjain Assignee: Samarth Jain / @samarthjain

PRs and other links:

Note: This issue was originally created as PARQUET-1643. Please see the migration documentation for further details.

asfimport avatar Aug 28 '19 18:08 asfimport

Which "non-hadoopy implementation" is this referring to? Is this referring to SnappyDecompressor in org.apache.parquet.hadoop.codec?

zhiyun-deng avatar Oct 03 '24 07:10 zhiyun-deng