Michael McCandless comments

Results 216 comments of


                                            Michael McCandless

[WIP]: fdb lucene-4.6.1 perftest

Hi, can you described what change you are suggesting here?

Large DocValues field retrieval test

Maybe the geo benchmarks? They use doc values for computing distance, sorting? I think it's also possible to turn on `SortedSetDVFacets`. But I don't think we have any large (`BINARY`?)...

Added FlushIndexTask to flush documents at index thread level.

Thanks @balmukundblr this looks great! Could you please open a new PR on the new Lucene GitHub repo? https://github.com/apache/lucene Thanks!

Gradual naming convention enforcement.

Another few, not sure if these also fail on mainline (though prolly we have seed shifting?): ``` [junit4:pickseed] Seed property 'tests.seed' already defined: B87E3065EF9405AA [junit4] says ᐊᐃ! Master seed: B87E3065EF9405AA...

Gradual naming convention enforcement.

Ugh sorry wrong PR!

LUCENE-8947: Skip field length accumulation when norms are disabled

I now understand @rmuir 's concern: because today we force sum of term freq within a single document to fit in `int` (during this `invertState.length` accumulation for norms), and because...

LUCENE-8947: Skip field length accumulation when norms are disabled

> > > Hmm, but I think sumTotalTermFreq, which is per field sum of all totalTermFreq across all terms in that field, could overflow long even today, in and adversarial...

LUCENE-8996: maxScore is sometimes missing from distributed responses

Hmm, I see this [src fix was committed, but the new unit test was not committed](https://github.com/apache/lucene/commit/49631ace9f1ee110d52a207377e4926baef74929) -- was that intentional?

LUCENE-7882: First idea of using Java 15 hidden anonymous classes for Lucene expressions

Whoa, thanks @uschindler -- this looks awesome. > @mikemccand can you test this with JDK 15 (release candidate) and your test. You should not see any locks anymore, speed should...

[Segment Replication] Support shard promotion.

These are awesome questions about segment replication! This is indeed a challenging situation for segrep, but is solvable with one of the three proposed options. Lucene is fundamentally a write-once...