cogcomp-nlp icon indicating copy to clipboard operation
cogcomp-nlp copied to clipboard

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, t...

Results 104 cogcomp-nlp issues
Sort by recently updated
recently updated
newest added

@mssammon In `Annotator`, getView and addView functions seem to be doing the same thing (one lazy and one not-lazy). Is that really the difference, or anything else I'm missing?

The `mvn install` process fails on transliteration with error `error: unmappable character for encoding Cp1252` on https://github.com/CogComp/cogcomp-nlp/blob/master/transliteration/src/main/java/edu/illinois/cs/cogcomp/transliteration/Program.java#L508

Imagine you want to pas some options to an annotator, through pipeline or any other annotator service. How can we do this? We discussed something similar to this, but when...

Currently [VerbSense is using an old version of SL](https://github.com/CogComp/cogcomp-nlp/blob/master/verbsense/pom.xml#L70-L72) which gets overwritten by a newer version when trying to access it in the pipeline. (hence verb-sense doesn't work in the...

We get a ton of collisions on the hashCode custom produced by the Constituent class.

Why we never talk about the metrics in the similarity module? Like we should let users choose their metric e.g. cosine, etc. ...

In the similarity module add a table on the size of the vocabulary for each similarity resource.

``` 2013-08-05T22: 47: 53 China's Ministry of Public Security to crack down on the use of "pseudo-base station" to implement illegal and criminal activities arrested 217 suspects liuyizhan China's Ministry...

This example: "This vulnerability happened in Firefox 1.x before 1.8, Safari 2.x before 2.8." not able to recognize that 1.x and 2.x is one single number.

@Slash0BZ do you think this has been affected by your change in tokenization? ![screen shot 2017-09-08 at 9 47 17 am](https://user-images.githubusercontent.com/2441454/30222083-c7545a32-947a-11e7-9d90-022a326d4e69.png) "E.coli" should be one token, I think.