Adrien Grand issues

Results 40 issues of


Adrien Grand

Can UTF8StreamJsonParser#finishString be made faster with VarHandles?

I was looking at `UTF8StreamJsonParser#finishString` which seems to mostly consist of scanning for a trailing quote, with an optimized code path for ASCII strings. This optimized code path for ASCII...

3.x

TestIDVersionPostingsFormat failure

### Description This failure is not reproducible, which is maybe not too surprising given that the test involves concurrency. A few things are interesting: - This is the same failure...

type:test

Make Weight#scorerSupplier abstract, Weight#scorer final

### Description A long time ago, we introduced `Weight#scorerSupplier` to enable [query planning for range queries](https://www.elastic.co/blog/better-query-planning-for-range-queries-in-elasticsearch), using either points or doc values to run a range query depending on which...

type:enhancement

Improve Lucene's I/O concurrency

### Description Currently, Lucene's I/O concurrency is bound by the search concurrency. If `IndexSearcher` runs on N threads, then Lucene will never perform more than N I/Os concurrently. Unless you...

type:enhancement

Make `OneMerge#reorder` preserve blocks.

This updates `IndexWriter` to only call `OneMerge#reorder` when it has a chance to preserve the block structure, ie. either there are no blocks or blocks are identified by a parent...

Stale

Can we decrease the overhead of skipping?

### Description On top-k queries, Lucene is now competitive with Tantivy/PISA on https://tantivy-search.github.io/bench/, but it's still quite slower on counting queries. This made me want to run a similar experiment...

type:enhancement

Should we use a SparseFixedBitSet when deletes are sparse?

### Description @uschindler asked this question in https://lists.apache.org/thread/6o3hn3x8syfm8lj93kk5rrxb0kx701gp. In this discussion, we were looking for introducing the ability to iterate deleted docs, in order to compute (cheaply!) some facets across...

type:enhancement

Adrien Grand

Can UTF8StreamJsonParser#finishString be made faster with VarHandles?

TestIDVersionPostingsFormat failure

Make Weight#scorerSupplier abstract, Weight#scorer final

Improve Lucene's I/O concurrency

Make `OneMerge#reorder` preserve blocks.

Can we decrease the overhead of skipping?

Should we use a SparseFixedBitSet when deletes are sparse?

Cut over HNSW's neighbor lists to group-varint?

Enable recursive graph bisection?

Use same BM25 k1/b parameters across engines.