parquet-java icon indicating copy to clipboard operation
parquet-java copied to clipboard

Drop Hadoop 2 input stream reflection from parquet-hadoop

Open asfimport opened this issue 3 years ago • 0 comments

Parquet uses reflection to load a hadoop2 input stream, falling back to a hadoop-1 compatible client if not found.

All hadoop 2.0.2+ releases work with H2SeekableInputStream, so the binding to H2SeekableInputStream reworked to avoid needing reflection. This would make it a lot easier to probe for/use the bytebuffer input, and line the code up for more recent hadoop releases.

H1SeekableInputStream is still needed to handle streams without ByteBufferReadable.

At some poiint support for ByteBufferPositionedReadable is needed, because that is really what parquet wants. that's where reflection will be needed

Reporter: Steve Loughran / @steveloughran

Related issues:

Note: This issue was originally created as PARQUET-2151. Please see the migration documentation for further details.

asfimport avatar Jun 06 '22 17:06 asfimport