Jan Niestadt

Results 154 comments of Jan Niestadt

This is not a performance critical part of the code, so clarity is more important in my opinion.

(when we integrate with Solr, this would have to be added to the Lucene index as well, not a separate file)

The new integrated index format will store the input format config inside the index metadata. We will probably deal with the old AutoSearch corpora by using a proxy that forwards...

We don't really use numeric search fields ourselves (other than for a few bookkeeping fields we don't search on), but they should work of course, so this is a bug....

This took way too long. I've looked into it now and I can see that numeric fields are completely broken. The field type is read from the configuration, but is...

Note that this is different from the HitGroupsTokenFrequencies code - that makes specific groupings that don't store hits possible. This is about not storing hits at all when grouping is...

Good idea, and shouldn't be too difficult I think. I might have a look soon. I'm not sure if tagging different parts of the query with the same group name...

https://github.com/INL/BlackLab/tree/group-by-capture Work in progress.

Should work, please test. :-) (e.g. group by property `capture:word:i:X` to group on named capture `X`)

That's above my pay grade. ;-)