FrequencyWords icon indicating copy to clipboard operation
FrequencyWords copied to clipboard

Map words to word type

Open MaxValue opened this issue 8 years ago • 1 comments

Currently those words are just "WORD OCCURENCECOUNT".

I think it is highly useful for many individuals to have "WORD OCCURENCECOUNT TYPE", whereas TYPE specifies the word type. This word type should have the format convention used in natural language processing: NN = Noun, VB= Verb, JJ = Adjective, ...

I am in the process of doing this, the stanford tagger in combination with the nltk module seems to be the most usable one. Having installation troubles at the moment.

MaxValue avatar Aug 31 '17 15:08 MaxValue

@MaxValue The key would be to identify the type which isn't easy / reliable. good idea though

hermitdave avatar Oct 02 '17 16:10 hermitdave