cogcomp-nlp icon indicating copy to clipboard operation
cogcomp-nlp copied to clipboard

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, t...

Results 104 cogcomp-nlp issues
Sort by recently updated
recently updated
newest added

similarity/src/main/java/edu/illinois/cs/cogcomp/nesim/io/StandardInputOutput.java similarity/src/main/java/edu/illinois/cs/cogcomp/wsim/wordnet/Util.java

Unless your sentence is very simple, nom-srl almost always returns null.

[Line 156](https://github.com/CogComp/cogcomp-nlp/blob/master/pipeline/src/main/java/edu/illinois/cs/cogcomp/pipeline/server/ServerClientAnnotator.java#L156) causes some un-intended side-effects. Since the ServerClientAnnotator creates a temporary TextAnnotation (from the server response), the new view that is added still refers to `newTA`. Consider the following...

@mssammon do we have any reason to have [this function](https://github.com/CogComp/cogcomp-nlp/blob/master/core-utilities/src/main/java/edu/illinois/cs/cogcomp/annotation/BasicAnnotatorService.java#L366-L392), in addition to the `addView` function?

@mssammon We briefly discussed having tests / evaluations for tokenizer. Thoughts how hard/easy that might be? If we have the data, I can have a look.

I just tried to use an annotator and it failed (at least) because of MapDB issues. We should test this and have proper instructions on how to parallelize cached annotators.

Move datatore here, since it's already been used in different places here.

Each time we add a constituent to the SpanLabelView, we sort the results after appending the new constituent. This appears to be massively inefficient for very large files. When create...

Keeping tokenization issues here to be fixed in future: -- `Ph.D.` gets tokenized into `Ph.`, `.`, `D.`, while it should be `Ph.D.`: ``` When his son was still an infant,...