BlackLab
BlackLab copied to clipboard
Linguistic search for large annotated text corpora, based on Apache Lucene
The following queries should have identical results: ``` n:_ -case-> v1:_; -case-> v2:_ :: start(v1) < start(n) & start(n) < start(v2) n:_ -case-> v1:_; -case-> v2:_ :: start(v2) < start(n)...
A lot from plan-relations.md can be reused for the final documentation.
BlackLab should trim values while indexing by default, otherwise an XML structure like this will index a word ending in a newline: ```xml ende ```
We currently only CI test JSON responses, but we rely on certain XML responses as well in corpus-frontend. We should at least test these XML responses to ensure they don't...
`IntArrays.parallelQuickSort()` really sped up sorting terms while writing them to disk or while reconstructing the global terms list. Sorting hits by a `HitProperty` seems to still be done single-threaded in...
With a current checkout of the dev branch and Java 11: ```sh $ git describe v4-alpha2-34-g16ef16df $ java -version openjdk version "11.0.20.1" 2023-08-24 OpenJDK Runtime Environment Temurin-11.0.20.1+1 (build 11.0.20.1+1) OpenJDK...
The indexing system has grown complex over the years, and some features aren't very useful, or aren't needed anymore if you use Saxon with XPath 3 support. We should deprecate...
Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects) from 1.15.2 to 1.15.4. Commits 6585820 Release version 1.15.4 of the npm package. 7a6567e Disallow bracketed hostnames. 05629af Prefer native URL instead of deprecated url.parse. 1cba8e8 Prefer native...