Shubham Chaudhary
Shubham Chaudhary
The benchmarks starts and instantiates the `TaskThreads` which then calls the `createTasks()` method to build/create all the tasks(along with included parsed query) and pass those created tasks to `TaskThread` for...
I have added a static factory method `getVectorValues` in `VectorValues` which returns the EMPTY `VectorValues` instance if the field is not found in the segment or if its not a...
### Description Failing seed (`E9414A90E55BE2D`) generates empty string with `TestUtil#randomUnicodeString` whereas both field names must not be same as per [this](https://github.com/apache/lucene/blob/main/lucene/core/src/java/org/apache/lucene/index/FieldInfos.java#L399-L405). This fix adds an extra check that prevents this...
### Description We are currently using Murmurhash 2([MurmurHash64.java](https://github.com/apache/lucene/blob/main/lucene/codecs/src/java/org/apache/lucene/codecs/bloom/MurmurHash64.java)) in the bloom filters implementation in lucene where we also have Murmurhash 3 (the latest one available in the [MurmurHash](https://en.wikipedia.org/wiki/MurmurHash) family of...
### Description - This PR addresses #13207 to convert more classes on `main` branch to record classes on main (Lucene 10 only). - It moves a lot of data classes(120...
Added 2 new tasks for multi keyword negated queries for performance comparison. - OrNegatedHighHigh : `(-A -B -C -D)` - OrNegatedHighHighOpt : `-(A B C D)` **UPDATE :** See...
#### Description - Currently, there is no straight-forward way to disable in lucene benchmarks(?) which could be required in testing some optimizations like #258. I'd great if we could add...
### Description Changes in this PR : 1. Makes `getClauses(Occur occur` method public to get the collection of one type of clauses. Without public I had to call `getClauses` and...
We could add a tool that could take a `.vec` file having vectors and generate a nice report(maybe visualization?) about the vector distribution, min, max values etc. This would be...
It might be useful to have a tool that associates some weight with the documents and able to run weighted recall to give a sense of effectiveness of the change...