arrow icon indicating copy to clipboard operation
arrow copied to clipboard

[C++][Parquet] Develop external predicate pushdown API for column readers

Open asfimport opened this issue 9 years ago • 1 comments

This will happen significantly downstream of where we are at right now, but we should be planning ahead to facilitate scanning Parquet files with externally-defined predicates as a primary use case.

I suggest that the most general (and high performance) predicate will be batch-oriented; i.e. the predicate will be passed a batch of materialized values from one or more columns, and it returns an array of booleans indicating whether or not the predicate is true. We can also develop a row-by-row "scalar" predicate API if users need that.

Reporter: Wes McKinney / @wesm

Note: This issue was originally created as PARQUET-473. Please see the migration documentation for further details.

asfimport avatar Jan 27 '16 16:01 asfimport

This issue hasn't had activity in a long time. If it's still being worked on, please leave a comment. Otherwise, it will be closed on 23rd June.

Labelled Status: Stale-Warning for tracking.

thisisnic avatar Jun 21 '25 08:06 thisisnic