dkpro-core issues

Tagset used by hu_szeged_kr.model.gz is not known

``` Tagset used by hu_szeged_kr.model.gz is not known. Currently recorded as "unknown" in the model meta data. ``` Original issue reported on code.google.com by `richard.eckart` on 2014-01-12 19:02:03

reckart

🐛Bug

Module-hunpos

Extracting tagsets from HunPos models not supported

1

``` Extracting tagsets from HunPos models not supported ``` Original issue reported on code.google.com by `richard.eckart` on 2014-01-12 19:00:57

reckart

🐛Bug

Module-hunpos

StanfordNamedEntityRecognizer does not use existing tokenization

1

``` StanfordNamedEntityRecognizer does not use existing tokenization. The annotations created by it may not always be colocated with tokens! ``` Original issue reported on code.google.com by `richard.eckart` on 2013-09-16 10:31:15

reckart

🐛Bug

Module-stanfordnlp

POS tagset extracted from French maltparser model looks very strange

1

``` POS tagset extracted from French maltparser model looks very strange: Tagset [null] for layer [de.tudarmstadt.ukp.dkpro.core.api.lexmorph.type.pos.POS] contains [39] tags: /CC /P /PONCT 4/DET ADJ ADJWH ADV ADVWH CC CLO CLR...

reckart

🐛Bug

Module-maltparser

XmlReader assumes all elements to be on the second level

1

``` This seems a bit to strict for most purposes. Should be generalized better. ``` Original issue reported on code.google.com by `torsten.zesch` on 2013-08-05 11:20:38

reckart

🐛Bug

Reported dependency tagset may be incomplete

2

``` As John Bauer commented regarding the fetching of the dependency relation tagset: One thing worth noting is that the dependencies list can actually change over time as it comes...

reckart

🐛Bug

Module-stanfordnlp

Migrate all readers from the JWPL Parser to SWEBLE

2

``` In the current trunk, JWPL has changed from using its own parser to using the SWEBLE parser. The old parser is still available in its own module and is...

reckart

🐛Bug

Module-io.jwpl

Stem and Lemma should be defined in LexMorph API

11

``` Currently Stem and Lemma are defined in the Segmentation API. Arguably, they don't have anything to do with that API other than being used as features in Token. The...

reckart

🐛Bug

Module-api.lexmorph

Automatically used standard stopword lists depending on document language

10

``` Snowball comes with a set of standard stopword lists. Per default the tagger should detect which language a document has and use the standard list for that language. It...

reckart

🐛Bug

Module-stopwordremover

NorvigSpellingCorrector uses wrong JCasAnnotator_ImplBase import

The `NorvigSpellingCorrector` does not use the uima-fit JCasAnnotator_ImplBase import, which prevents providing a corpus file for the in-vocabulary words (parameter is always null). This bug breaks the module.

Horsmann

🐛Bug

dkpro-core
dkpro-core copied to clipboard

Metadata

Tagset used by hu_szeged_kr.model.gz is not known

Extracting tagsets from HunPos models not supported

StanfordNamedEntityRecognizer does not use existing tokenization

POS tagset extracted from French maltparser model looks very strange

XmlReader assumes all elements to be on the second level

Reported dependency tagset may be incomplete

Migrate all readers from the JWPL Parser to SWEBLE

Stem and Lemma should be defined in LexMorph API

Automatically used standard stopword lists depending on document language

NorvigSpellingCorrector uses wrong JCasAnnotator_ImplBase import

← Metadata

Owner

Metadata

dkpro-core dkpro-core copied to clipboard

Metadata

← Metadata

Owner

Metadata

dkpro-core
dkpro-core copied to clipboard