cogcomp-nlp
cogcomp-nlp copied to clipboard
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, t...
I cannot succesfully execute the script scripts/runPipelineOnDataset.sh, I see the following exceptions. Am I doing something wrong? Thank you in advance `java.io.InvalidClassException: edu.illinois.cs.cogcomp.sl.core.SLModel; local class incompatible: stream classdesc serialVersionUID =...
@lingss0918 what is the best way to get vectors (or any low-level representations) for words, in the similarity package? Would be great if we have something on that in the...
@Clouddd @666666fzl can you add sample outputs to the readme of the temporal normalizer? Like for a sample input, what kind of output do we expect from your system? What...
Throws exceptions for many inputs. Would be nice to understand why. Here is an example input: ``` A ( n ) beneficial relationship exists when a certain type of tree...
The "testSentencePipeline" in the "SentencePipelineTest" https://github.com/CogComp/cogcomp-nlp/blob/master/pipeline/src/test/java/edu/illinois/cs/cogcomp/pipeline/main/SentencePipelineTest.java#L89 leads to an assertion error "java.lang.AssertionError: Expecting token: The, found Th instead." at https://github.com/CogComp/cogcomp-nlp/blob/master/core-utilities/src/main/java/edu/illinois/cs/cogcomp/core/datastructures/textannotation/TreeView.java#L543
We have a few tests here which are ignored. Do we want to un-ignore them? https://github.com/CogComp/cogcomp-nlp/blob/8193e054b8f8172f924fe82a7ad47d86c07df9f0/similarity/src/test/java/edu/illinois/cs/cogcomp/mrcs/LlmSimTest.java#L37-L63
https://github.com/CogComp/cogcomp-nlp/blob/master/ner/src/main/java/edu/illinois/cs/cogcomp/ner/StringStatisticsUtils/MyString.java#L48 df.parse(s,pp) is probably not thread-safe when reusing ParsePosition on different strings. Calling this function concurrently cause some undefined behaviors (unexpected exceptions) for me. It may be safer to add...
We have some embeddings on `/shared/corpora/`. Not all of these are included in the similarity package. Wondering if we can included them in the similarity project: ``` drwxrwsrwx 2 upadhya3...
https://github.com/CogComp/cogcomp-nlp/blob/master/core-utilities/src/main/java/edu/illinois/cs/cogcomp/core/utilities/JsonSerializer.java#L463 This method requires that strings contain only characters corresponding to tokens. For a transformed text, this means a version of that transformation must be maintained and used here, or...
test will fail with unexpected format error if TextAnnotation or other dependency changes.