cogcomp-nlp
cogcomp-nlp copied to clipboard
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, t...
similarity/src/main/java/edu/illinois/cs/cogcomp/nesim/io/StandardInputOutput.java similarity/src/main/java/edu/illinois/cs/cogcomp/wsim/wordnet/Util.java
Unless your sentence is very simple, nom-srl almost always returns null.
[Line 156](https://github.com/CogComp/cogcomp-nlp/blob/master/pipeline/src/main/java/edu/illinois/cs/cogcomp/pipeline/server/ServerClientAnnotator.java#L156) causes some un-intended side-effects. Since the ServerClientAnnotator creates a temporary TextAnnotation (from the server response), the new view that is added still refers to `newTA`. Consider the following...
@mssammon do we have any reason to have [this function](https://github.com/CogComp/cogcomp-nlp/blob/master/core-utilities/src/main/java/edu/illinois/cs/cogcomp/annotation/BasicAnnotatorService.java#L366-L392), in addition to the `addView` function?
@mssammon We briefly discussed having tests / evaluations for tokenizer. Thoughts how hard/easy that might be? If we have the data, I can have a look.
I just tried to use an annotator and it failed (at least) because of MapDB issues. We should test this and have proper instructions on how to parallelize cached annotators.
Move datatore here, since it's already been used in different places here.
Each time we add a constituent to the SpanLabelView, we sort the results after appending the new constituent. This appears to be massively inefficient for very large files. When create...
Same as #292
Keeping tokenization issues here to be fixed in future: -- `Ph.D.` gets tokenized into `Ph.`, `.`, `D.`, while it should be `Ph.D.`: ``` When his son was still an infant,...