BlackLab
BlackLab copied to clipboard
Linguistic search for large annotated text corpora, based on Apache Lucene
A refactoring of BLS was started with PR #357, to allow us to reimplement the various requests in Solr as well without duplicating code. This is a good first step,...
Copied comment from [this PR](https://github.com/INL/BlackLab/pull/352). Are all captured group names always registered correctly? Could this code be made more transparent? The issue is HitQueryContext being used per Spans to determine...
zlib support preset dictionaries, which is a way to improve compression if you know something about the structure of your data ahead of time. See https://www.ietf.org/rfc/rfc1950.txt In our case, using...
Hello, when I using 'termfreq' by this request: `http://localhost:8080/blacklab-server/my-index/termfreq?number=1000&outputformat=json`, and I got the response like this: ``` { "termFreq": { "1": 527, "2": 446, "3": 287, "4": 206, "5": 142,...
Dash and period in field, annotation and tag names were not allowed (partially solved now, see below). E.g. this query did not parse: containing "dog" but this one does: containing...
This is a broad and confusing error to get. Should ideally distinguish between these cases: - no `blacklab-server.yaml` could be found. Include link to docs, hints. - `blacklab-server.yaml` was found,...
Now that we correctly keep track of each corpus' `timeModified` (see #340), we could use it to improve client-side caching. E.g. use [If-Modified-Since](https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/If-Modified-Since)), as those should not change as long...
The current default on the dev branch (and for upcoming BlackLab 4.0), API v4, is transitional: - 98% compatible with v3 - adds new `/corpora/NAME` endpoints There's also the experimental...
As reported by @PrinsINT Attached input format and resulting xslt attached. The offending config is: ```yaml - name: word displayName: Word valuePath: if (./seg) then ./seg else . ``` Resulting...
e.g. if your config shows you're trying to index relations, but you're using VTD instead of Saxon, give an error message explaining the problem.