parquet-go icon indicating copy to clipboard operation
parquet-go copied to clipboard

parquet-tool - file statistics command

Open panamafrancis opened this issue 2 years ago • 1 comments

As a user i would like a command to aid in debugging parquet files. For instance I would like to obtain the following file stats in a single command:

  • compression algorithm
  • page type v1/v2?
  • row group size
  • author / created by
  • version
  • metadata
  • page size
  • total records /row count
  • any internal info that could help too

panamafrancis avatar Mar 04 '22 09:03 panamafrancis

re. https://github.com/fraugster/parquet-go/issues/85 how can we provide an estimate of the uncompressed size?

panamafrancis avatar Apr 19 '22 16:04 panamafrancis