lucene
lucene copied to clipboard
Apache Lucene open-source search software
### Description `node.numArcs == 0` happens on the last node(I think we fake it) of a input. But we set it `isFinal`, so we still can remove `node.numArcs == 0`.
### Description This is related to https://github.com/apache/lucene/issues/13883. The idea is to allow users to specify the RAM usage once and it will be automatically spread across N IndexWriter's so they...
### Description This PR addresses #13634. The main changes are in: - `AbstractKnnVectorQuery`, which adds a `seed` field. It scores this query if provided, and passes these seeds along to...
### Description It would take a bit of refactoring, but: ``` float dx = v - minQuantile; float dxc = Math.max(minQuantile, Math.min(maxQuantile, v)) - minQuantile; float dxs = scale *...
Resolves #13918 ### Description This introduces a `multiSelect(from, to, k[])` method on the `Selector` abstract class, and gives implementations of the method for both `Selector` implementations, `IntroSelector` and `RadixSelector`. This...
### Description This runs along the checks we already have for PR creation/update and warns us if there is no CHANGES.txt entry. Closes #13898
### Description Currently in `ScalarQuantizer`, `ScalarQuantizer.fromVectorsAutoInterval()` will issue 4 calls (per to scratch-batch, basically `len(vector)/20`) `Selector.select()` and `ScalarQuantizer.fromVectors()` will issue 2 calls. All of these 4/2 calls use the same...
There are so many projects (latest one I've heard of is Nixiesearch, [presented at Haystack](https://shuttie.github.io/haystack24-nixiesearch-slides/)) trying to read Lucene indexes from S3, let's provide a S3-based (and other object stores)...
### Description I found this [article about an interesting SIGMOD paper](https://blog.acolyer.org/2020/10/19/the-case-for-a-learned-sorting-algorithm/) about a learned sorting algorithm which outperforms Radixsort by the factor 1.49. While the implementation doesn't look trivial, it...
### Description