stanza icon indicating copy to clipboard operation
stanza copied to clipboard

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Results 174 stanza issues
Sort by recently updated
recently updated
newest added

sorry for flooding the board with my issues, but here's another one with Stanza main branch. Please note that this is raised with English default model: I have processed files...

bug
fixed on dev

Similarly to the start_char and end_char information, it would be useful to write down annotations such as NER in the MISC column while calling convert_token_dict() Now the default is start_char/end_char,...

enhancement
pinned

For example, if a sentence ends with `thicc` and no sentence final punctutation, `thicc` is labeled `PUNCT`

bug

When using an English tokenizer with MWT (such as EWT, which treats possessive clitics as MWT), the NER model puts the tags on the entire token, not just the word,...

bug

NER models start from a pretrained embedding, same as depparse, pos, etc however, they finetune the embedding the entire finetuned embedding is then saved in the model. only the delta...

enhancement

I am trying to create a CoreNLPClient instance based on the models for German, but with the parameter `tokenize.whitespace = true`. It should perform POS-Tagging for German Text. In the...

question
pinned

**Describe the bug** Timeout when executing Semgrex query happens on long strings. This happens with the `stanfordnlp` library, but not the deprecated `python-stanford-corenlp` library. **To Reproduce** Starting server with: `java...

bug

**Describe the bug** The CoreNLP server is not stopped automatically after the `with` statement in Python is finished. This happens during interactive Python sessions and running Python scripts as a...

bug

**Describe the bug** We're using Stanza in a package called [netts](https://github.com/alan-turing-institute/netts). However, on some windows machines, we're finding the CoreNLP server does not shut down correctly (port is not freed)....

bug
awaiting feedback

* Currently, stanza produces universal dependencies for many languages, It would be great if it could be extended to augment the resulting universal dependencies with enhancements (Just like CoreNLP's Enhanced/Enhanced++/CCprocessed...

enhancement
pinned