spark-avro
spark-avro copied to clipboard
writing avro data in parquet format
Hi there, While there is a nice way to save an avro schema in a parquet file when working with RDD's, I've been unable to find something similar for DataFrames. Are there any plans to add this feature to the spark-avro project?
@liancheng, do you know whether Spark's Parquet data source supports this? Would this even be possible in this library or is this request inherently out of scope w.r.t. this library's APIs?