dkpro-core icon indicating copy to clipboard operation
dkpro-core copied to clipboard

Tagset used by Swedish hunpos model needs additional processing.

Open reckart opened this issue 10 years ago • 1 comments

The Swedish model for hunpos produces rich tags including morphological information,
e.g.:

"PN_NEU_SIN_DEF_SUB/OBJ", "VB_PRS_AKT", "DT_NEU_SIN_IND", "NN_NEU_SIN_IND_NOM", "DL_MAD"

The actual SUC tags are: PN, VB, DT, NN, MAD

Some kind of post-processing needs to be done for these tags. Possible a class which
is shipped as part of the model instead of being shipped as part of the wrapper. This
would be a new concept.

Original issue reported on code.google.com by richard.eckart on 2014-01-12 19:06:41

reckart avatar May 12 '15 22:05 reckart

Here is the list of tags http://spraakbanken.gu.se/korp/markup/msdtags.html

DL (probably "delimiter") is not in the list, so they seem to use a variant of the original list.

zesch avatar Oct 28 '15 09:10 zesch