lucene
lucene copied to clipboard
Apache Lucene open-source search software
cut off useless search for scorer scan
This PR add filter class for each index format classes and their writer/producer/reader/consumer and add a tracking codec class to track merge time of each index part. [Jira Issue Link](https://issues.apache.org/jira/projects/LUCENE/issues/LUCENE-10670)
### Description (or a Jira issue link if you have one) Ensure test coverage for WANDScorer by using a test query #11671
This makes incremental progress against LUCENE-10207, allowing `TermInSetQuery` to provide cost estimation so it might be used in an `IndexOrDocValuesQuery`.
### Description (or a Jira issue link if you have one) This change introduces an optimization to `TermInSetQuery` when a term is present that matches all docs in a segment.
### [Issue 1060](https://github.com/apache/lucene/issues/1060)
`KnnGraphTester` has some drawbacks and needs a refactor because: - Can only test HNSW search time on graphs created by itself (cannot work easily with existing / custom indexes) -...
https://issues.apache.org/jira/browse/LUCENE-10612 # Description Lucene93Codec should provide a way for providing custom parameters to HnswVectorsFormat # Solution For providing the various parameters to Lucene93Codec, I wrap them up in a Lucene93CodecParameters...
This commit adds a new `TermsEnumIndex` abstraction in `oal.index` that wraps a `TermsEnum` and an index of the segment that it belongs to, and can be used to create priority...
LUCENE-10207: Add "slow" term-in-set query support to SortedDocValuesField / SortedSetDocValuesField
# Description This change introduces "slow" term-in-set query support exposed through SDV and SSDV Fields. These "slow" queries can be combined with standard TermInSet queries in an IndexOrDocValues query for...