parquet-java icon indicating copy to clipboard operation
parquet-java copied to clipboard

PARQUET-198: Initial commit of ParquetAvroScheme (WIP)

Open AndreSchumacher opened this issue 9 years ago • 1 comments

This is a first attempt of adding a ParquetAvroScheme to parquet-cascading that can be used to read and write Avro records from Parquet files (say, from within Scalding). The code draws heavily on the Thrift implementation of the respective classes.

Currently only code-generated Avro classes are supported.

This code is also partly based on the work done here: https://github.com/epishkin/scalding/tree/parquet_avro/scalding-parquet

AndreSchumacher avatar Mar 23 '15 15:03 AndreSchumacher

Any chance this will eventually make it into the codebase? I could really use it.

dkincaid avatar May 26 '16 23:05 dkincaid