pinot Query Cost Model and Usage

Query Cost Model and Usage

Open atris opened this issue 2 years ago • 2 comments

This JIRA tracks building a mechanism for identifying and computing the indicative cost of a query, based on parameters such as query complexity and data statistics (cardinality, selectivity etc).

The cost model will be pluggable -- you just need to implement the interfaces.

The cost can be used for stuff like throttling and cancelling of expensive queries in the face of limited resource utilisation.

In a later stage, we can explore using the cost for a more intelligent query optimiser.

Note that this is more of a "logical" cost -- using statistics and query cyclomatic complexity.

Related work: https://github.com/apache/pinot/issues/9055

Jul 25 '22 17:07 atris

cc @yupeng9 @Jackie-Jiang

Jul 25 '22 17:07 atris

This should be very useful. Are we planning to build a stats / metadata store as part of this or will the cost model be applied on top of individual queries only without the use of stats store?

Jul 27 '22 17:07 amrishlal

pinot pinot copied to clipboard

Query Cost Model and Usage

pinot
pinot copied to clipboard