lucene
lucene copied to clipboard
Apache Lucene open-source search software
Our sparse indexes record the min/max value on multiple levels to speed up filtering. We could store pre-aggregated data there as well to speed up faceting, e.g. sum and value...
Our queries that take advantage of sparse indexes can match many documents at once. We shoud create `BulkScorer`s for them, to be able to use the `Collector#collect(DocIdStream)` API and collect...
### Description The idea is to get rid of `copy()` since it does more than is needed. What's needed is an independent vector accessor over the same values source. Adrien...
### Description The server will be decomm'ed soon, we should fix these refs: ``` $ git grep people.apache.org $ git grep home.apache.org ```
Seeing this mutex contended up to O(10ms) in Elasticsearch at times. Moving to CAS and removing the unnecessary alloction of a new instance for the bitwise-and with the mask makes...
### Description Relates to https://github.com/apache/lucene/issues/12892 For global ordinal-based join, we can support concurrent search. For numeric and term-based joins, we fail if we're called from a multithreaded searcher. I can...
issue : https://github.com/apache/lucene/issues/13802 - Many libraries(git code: [Elasticsearch](https://github.com/elastic/elasticsearch/blob/main/modules/analysis-common/src/main/java/org/elasticsearch/analysis/common/CommonAnalysisPlugin.java#L511), [OpenSearch](https://github.com/opensearch-project/OpenSearch/blob/main/modules/analysis-common/src/main/java/org/opensearch/analysis/common/EdgeNGramTokenizerFactory.java#L54)) based on Lucene use NGramTokenizer.DEFAULT_MAX_NGRAM_SIZE(=`2`) instead of EdgeNGramTokenizer's(=`1`) when configuring an **EdgeNGramTokenizer**. - By the above reason, it's NOT practical to...
Remove recurse into sub block when scan leaf block in IDVersionSegmentTermsEnumFrame#scanToTermLeaf.
### Description
### Description In some vector search cases, users may already know some documents that are likely related to a query. Let's support seeding HNSW's scoring stage with these documents, rather...
### Description Hello, We recently have some issues with a Polygon geometry and the following error: "Unable to Tessellate shape” when we try to create a new document with the...