Michael Froh
Michael Froh
> This should work, though I'm wary of making it the new way that collectors need to interact with doc values if they want to be able to take advantage...
I was looking into how to implement this and I think I've mostly got it -- essentially, I would write the `SynonymMap` to a file (which could be an offline...
I have a (rough) PR to address this: https://github.com/apache/lucene/pull/13054. I also moved the output word lookup off-heap, but it requires a random seek (within a hopefully MMapped file) before every...
I think the `else` clause for the cost estimate is also not great. I came across this same problem where a user was essentially running a single-term `TermInSetQuery` (that actually...
I get that part of the point of this cost estimate is to avoid the (potentially-expensive) rewrite if, e.g. we can do a doc-value rewrite instead, but I'm thinking we...
Have you looked at PPL: https://opensearch.org/docs/latest/search-plugins/sql/ppl/syntax/ ?
@opensearch-project/admin -- Can we please reassign this to the opensearch-project/sql repository? The requested capability sounds like something that should be supported by PPL. Thanks
Thinking through this -- I still don't particularly like how complicated these cache stats have become. It feels like a lot of code for relatively little value. That said, most...
Incidentally, I think I can improve the situation for regex queries by applying the logic described in https://swtch.com/~rsc/regexp/regexp4.html
Good news! > Incidentally, I think I can improve the situation for regex queries by applying the logic described in https://swtch.com/~rsc/regexp/regexp4.html Once I realized that Lucene's amazing [`RegExp`](https://github.com/apache/lucene/blob/main/lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java) class will...