vespa icon indicating copy to clipboard operation
vespa copied to clipboard

cardinality estimation for filters and queries

Open patelprateek opened this issue 1 year ago • 1 comments

new to vespa engine. I am trying to use this for some internal dataset Wanted to get better understanding of some internal details to understand latency and compute resource usage.

When trying to use vector search with some constraints , how exactly we do cardinality estimation for the constraints : are there any documentation or pointers i can read more details , trying to understand how precise are these estimations , are they probabilistic with error bounds or accurate, whats the cost of cardinality estimation vs actually running the query.

patelprateek avatar Sep 20 '22 17:09 patelprateek

This blog post explains how Vespa performs filters and how we estimate filters. Vespa supports both pre-filtering and post-filtering. I suggest that you start with that and update this ticket with follow up questions.

jobergum avatar Sep 21 '22 08:09 jobergum

I'm closing this out @patelprateek. Feel free to re-open if you have questions on this - or join our slack - linked to from https://vespa.ai/support

jobergum avatar Sep 27 '22 08:09 jobergum