BlackLab
BlackLab copied to clipboard
Linguistic search for large annotated text corpora, based on Apache Lucene
Right now, BlackLab doesn't check the configured pid field when adding documents. So it's possible to add a document twice; two copies will exist in BlackLab that have the same...
(see https://github.com/INL/corpus-frontend/issues/527) `/docs/DOCID/contents` should add a hit number to `` tags, which will allow the frontend to deal with overlapping hits and hits that e.g. span across `` tags. ```xml...
Thanks to @eduarddrenth for this branch, which updates our previous experiment and solves more issues. CURRENT STATUS: working, experimental. Will probably be merged in after releasing v4 soon Old comments:...