Ivan G
Ivan G
I'm going to support all the compression methods in v4, but can't give an estimate when it will be public.
I've got all compression formats supported in v4, it's far from release though https://github.com/aloneguid/parquet-dotnet/blob/delta/docs/compression.md
as v4/async is out, compression codecs should come soon
All compression methods are supported in [4.1.0](https://github.com/aloneguid/parquet-dotnet/releases/tag/4.1.0)
I think it looks good, thanks. I can merge if you are happy.
Closing due to no activity
I think you can do that theoretically, but writing in chunks will effectively discard any logical compression. You could use smaller row group size and array pooling?
thanks @hrabeale please go ahead, all PRs are welcome :) I'll review on Monday.
5000 rows is general parquet recommendation from the original Java implementation but it was Hadoop days and not-so-big-data. These days i'd think you would tune it to a reasonable amount...