wordnet
wordnet copied to clipboard
Note of related work: Finnish WordNet adaptation for nltk's WordNet module
FinnWordNet has been incorporated into OMW of course, but firstly it's and old version, and secondly FinnWordNet is actually aligned with PWN at the lemma level rather than the synset level.
The problem is is that FinnWordNet adds other bits of markup into its WordNet index files.
Here it is: https://github.com/frankier/finntk/blob/master/finntk/wordnet/reader.py
One thing that might be nice here is to here -- assuming you have any interest -- is to make it so it's possible to do this kind of overriding with less copypasting. Another possibility would be to add FinnWordNet support directly to this repository.
I'm not really sure what's best here but I thought I'd ask.
Let me see what I can do in the code. Any documentation of what are the extra markup in Finn WN?