CoreNLP icon indicating copy to clipboard operation
CoreNLP copied to clipboard

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Results 152 CoreNLP issues
Sort by recently updated
recently updated
newest added

What are the training materials for your Chinese model? Are they newspaper news or are they social media texts? Or maybe there are other sources.

Hello everyone, I'm trying to use this dependency from my AEM project (maven based) but the dependency is not being installed because I have the following error: `Bundle-SymbolicName header missing,...

I have a test that I run on `CoreMapExpressionExtractor` which relies on caching the result of an annotation run so I don't have to run the parser each time. I...

The following snippet of code seems to correctly split on the hyphen in "year-end" in 3.9.2, but no longer in 4.4.0. Is this expected behavior? ```java public static void main(String[]...

cleanup
fixed on dev

I think we should update the new version for testing. At present, we are using CoreNLP. I will add some interesting tests later, including all languages. Thanks

I have a token which is recognized as others in some classifiers however the same token is identified as Person or Organization. In such a case the entity need to...

What are the current best practices for converting OntoNotes 5.0 to UD format? I didn't find any documentation or issues about this, sorry if it was already asked. I used...

Hi guys, I am receiving several errors that is continually appearing every few minutes and I am not sure why, can someone please tell me what these following errors maybe...

Is it available to use CoreNLP models or any model developed by Stanford NLP group on constituency parsing in Spanish?

PTBTokenizer crashed on this unicode character (U+2063, decimal: 8291) which is an invisible comma/separator, and threw this error: > Untokenizable: ⁣ (U+2063, decimal: 8291) Exception in thread “main” java.lang.ArithmeticException: integer...

tokenize