CoreNLP icon indicating copy to clipboard operation
CoreNLP copied to clipboard

How to get UD-POS tags in conllu dependency output?

Open nirmalsurange opened this issue 6 years ago • 1 comments

I am trying to run coreNLP for parsing some documents. How to get UD-POS tags along with fine-grained tags in output? I tried -outputFormat conllu, but the UPOS column remained empty. For example, I ran this command: "java -cp '../stanford-corenlp-full-2018-10-05/*' edu.stanford.nlp.pipeline.StanfordCoreNLP -file './samples/BA_wiki_00_id4397286_sample.txt' -outputFormat conllu -outputDirectory './samples/' " I want this: 1 Antonius Antonius NOUN NNP _ 2 compound _ _ that is 'NOUN NNP' both tags, but this is what I got:

1 Antonius Antonius _ NNP _ 2 compound _ _ 2 Romanus Romanus _ NNP _ 12 nsubj _ _ 3 ( ( _ -LRB- _ 4 punct _ _ 4 fl. fl. _ VBP _ 12 nsubj _ _ 5 1400 1400 _ CD _ 4 dep _ _ 6 -- -- _ : _ 7 punct _ _ 7 1432 1432 _ CD _ 5 dep _ _ 8 ) ) _ -RRB- _ 7 punct _ _ 9 was be _ VBD _ 12 cop _ _ 10 an a _ DT _ 12 det _ _ 11 Italian italian _ JJ _ 12 amod _ _ 12 composer composer _ NN _ 0 root _ _ 13 of of _ IN _ 17 case _ _ 14 the the _ DT _ 17 det _ _ 15 early early _ JJ _ 17 amod _ _ 16 15th 15th _ JJ _ 17 amod _ _ 17 century century _ NN _ 12 nmod _ _ 18 . . _ . _ 12 punct _ _

Please tell command-line solutions.

nirmalsurange avatar Nov 28 '19 17:11 nirmalsurange

The tagger models only set XPOS (even the ones named "UD" still set the XPOS)

AngledLuffa avatar Feb 14 '22 06:02 AngledLuffa