BlackLab icon indicating copy to clipboard operation
BlackLab copied to clipboard

Linguistic search for large annotated text corpora, based on Apache Lucene

Results 103 BlackLab issues
Sort by recently updated
recently updated
newest added

A refactoring of BLS was started with PR #357, to allow us to reimplement the various requests in Solr as well without duplicating code. This is a good first step,...

refactor

Copied comment from [this PR](https://github.com/INL/BlackLab/pull/352). Are all captured group names always registered correctly? Could this code be made more transparent? The issue is HitQueryContext being used per Spans to determine...

refactor

zlib support preset dictionaries, which is a way to improve compression if you know something about the structure of your data ahead of time. See https://www.ietf.org/rfc/rfc1950.txt In our case, using...

enhancement

Hello, when I using 'termfreq' by this request: `http://localhost:8080/blacklab-server/my-index/termfreq?number=1000&outputformat=json`, and I got the response like this: ``` { "termFreq": { "1": 527, "2": 446, "3": 287, "4": 206, "5": 142,...

Dash and period in field, annotation and tag names were not allowed (partially solved now, see below). E.g. this query did not parse: containing "dog" but this one does: containing...

enhancement
indexing

This is a broad and confusing error to get. Should ideally distinguish between these cases: - no `blacklab-server.yaml` could be found. Include link to docs, hints. - `blacklab-server.yaml` was found,...

documentation
robustness

Now that we correctly keep track of each corpus' `timeModified` (see #340), we could use it to improve client-side caching. E.g. use [If-Modified-Since](https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/If-Modified-Since)), as those should not change as long...

enhancement
performance

The current default on the dev branch (and for upcoming BlackLab 4.0), API v4, is transitional: - 98% compatible with v3 - adds new `/corpora/NAME` endpoints There's also the experimental...

webservice

As reported by @PrinsINT Attached input format and resulting xslt attached. The offending config is: ```yaml - name: word displayName: Word valuePath: if (./seg) then ./seg else . ``` Resulting...

bug

e.g. if your config shows you're trying to index relations, but you're using VTD instead of Saxon, give an error message explaining the problem.

indexing