Paul Masurel
Paul Masurel
> Even based on your blog post that you wanted to index open-crawl, globally sorting the urls, will give you fast-field on domain,tld,subdomain,path very efficiently. - globally sorted urls will...
> Performance overhead; two lookups instead of one Considering our docvalue codec is already dynamic couldn't we remove the extra cost? If a fastfield is multivalued according to the schema...
First of all that's awesome! Thank you for driving that effort. A couple of comment. It is a bad idea to commit after adding each document. Inserts are meant to...
@Frando also, do you need help for this?
I think Lucene solution is a good idea. I would use a normal inverted list for the moment, and store parent and child docs in the same segment as follows...
@petr-tik I would not worry, and this has nothing to do with merge policies.
Thank you for stating your use case, so clearly! The part that is the most difficult is probably: (b) one or more filters of the shape "there exists a B...
FuzzyTermQuery is in tantivy. RangeQuery exists too, but they are not great as they do not rely on a specific type of indexing. They just iterate the terms in the...
@mdianjun Oops correction. Range query are handled by tantivy's query grammar.
This ticket requires to flesh out more details. - what changes in the schema definition? - what happens when a user inserts a doc with missing values?