dkpro-core icon indicating copy to clipboard operation
dkpro-core copied to clipboard

Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.

Results 160 dkpro-core issues
Sort by recently updated
recently updated
newest added

``` Tagset used by hu_szeged_kr.model.gz is not known. Currently recorded as "unknown" in the model meta data. ``` Original issue reported on code.google.com by `richard.eckart` on 2014-01-12 19:02:03

🐛Bug
Module-hunpos

``` Extracting tagsets from HunPos models not supported ``` Original issue reported on code.google.com by `richard.eckart` on 2014-01-12 19:00:57

🐛Bug
Module-hunpos

``` StanfordNamedEntityRecognizer does not use existing tokenization. The annotations created by it may not always be colocated with tokens! ``` Original issue reported on code.google.com by `richard.eckart` on 2013-09-16 10:31:15

🐛Bug
Module-stanfordnlp

``` POS tagset extracted from French maltparser model looks very strange: Tagset [null] for layer [de.tudarmstadt.ukp.dkpro.core.api.lexmorph.type.pos.POS] contains [39] tags: /CC /P /PONCT 4/DET ADJ ADJWH ADV ADVWH CC CLO CLR...

🐛Bug
Module-maltparser

``` This seems a bit to strict for most purposes. Should be generalized better. ``` Original issue reported on code.google.com by `torsten.zesch` on 2013-08-05 11:20:38

🐛Bug

``` As John Bauer commented regarding the fetching of the dependency relation tagset: One thing worth noting is that the dependencies list can actually change over time as it comes...

🐛Bug
Module-stanfordnlp

``` In the current trunk, JWPL has changed from using its own parser to using the SWEBLE parser. The old parser is still available in its own module and is...

🐛Bug
Module-io.jwpl

``` Currently Stem and Lemma are defined in the Segmentation API. Arguably, they don't have anything to do with that API other than being used as features in Token. The...

🐛Bug
Module-api.lexmorph

``` Snowball comes with a set of standard stopword lists. Per default the tagger should detect which language a document has and use the standard list for that language. It...

🐛Bug
Module-stopwordremover

The `NorvigSpellingCorrector` does not use the uima-fit JCasAnnotator_ImplBase import, which prevents providing a corpus file for the in-vocabulary words (parameter is always null). This bug breaks the module.

🐛Bug