stanza icon indicating copy to clipboard operation
stanza copied to clipboard

More annotations in the MISC column

Open rahonalab opened this issue 3 years ago • 1 comments

Similarly to the start_char and end_char information, it would be useful to write down annotations such as NER in the MISC column while calling convert_token_dict() Now the default is start_char/end_char, but maybe it would be better to have this as an option, something like

calling_convert_token_dict(misc)

where misc is a list of annotations such as ner, start_char/end_char or multi_ner

rahonalab avatar Apr 01 '22 10:04 rahonalab

The next version will have NER tags in the misc field of the conll docs

https://github.com/stanfordnlp/stanza/pull/996

AngledLuffa avatar Apr 08 '22 21:04 AngledLuffa