Jon Bratseth issues

Results 25 issues of


                                            Jon Bratseth

Use the idf value from the largest index

Today we take the idf value from the current index when needing it for relevance scoring of a document. With uncommon terms and small memory indexes this can lead to...

Handle adding more new groups than the amount you already have

We currently take the median number of douments in the other groups to decide if a given group should be in rotation. This does not work when a majority of...

Configurable max token length

If clients feed very long tokens, or long text in fields without tokenization (match:exact/word), Vespa will produce an index containing them which is wasteful and possibly damaging. - Add a...

Stem prefix items

If we are searching a stemmed index, it's probably better to stem terms also when we are searching for prefixes.

Support userQuery/userInput in the JSON query format

Make the gram root settable

Less work to support it than to keep explaining how to do it in your app.

Exclude fields in a document summary

If you have lots of summary-enabled fields and you want a summary containing most, but not all of them (e.g not vectors), you need to repeat lots of field names....

enhancement

Bratseth/pack bits

@geirst please review. Sorry for the many commits, this was a bit exploratory. Now that we have contextual type resolving, I want to replace the earlier half-baked attempts at type...

General elementwise operator in ranking

Vespa provides the feature elementwise(bm25(field),dimension,cell_type) to calculate a text match score for each element in an array string field. Generalize this to support elementwise over any rank function, where features...

enhancement

Allow documentid to be turned into an attribute

Today, indexed Vespa documents have a special field `documentid` to which the summary transform `documentid` is applied, such that the field is populated with the full string document id, when...