flow icon indicating copy to clipboard operation
flow copied to clipboard

Parquet Filtering

Open norberttech opened this issue 2 years ago • 0 comments

Parquet comes with very handy mechanism called "Column Statistics" which says for example what are the min/max values, total number of null values etc.

By reading those statistics we won't need to iterate through the entire parquet file when for example we are looking for a data from a specific time range or value range.

norberttech avatar Oct 31 '23 11:10 norberttech