dkpro-core
dkpro-core copied to clipboard
Tagset used by Swedish hunpos model needs additional processing.
The Swedish model for hunpos produces rich tags including morphological information,
e.g.:
"PN_NEU_SIN_DEF_SUB/OBJ", "VB_PRS_AKT", "DT_NEU_SIN_IND", "NN_NEU_SIN_IND_NOM", "DL_MAD"
The actual SUC tags are: PN, VB, DT, NN, MAD
Some kind of post-processing needs to be done for these tags. Possible a class which
is shipped as part of the model instead of being shipped as part of the wrapper. This
would be a new concept.
Original issue reported on code.google.com by richard.eckart on 2014-01-12 19:06:41
Here is the list of tags http://spraakbanken.gu.se/korp/markup/msdtags.html
DL (probably "delimiter") is not in the list, so they seem to use a variant of the original list.