sentiment icon indicating copy to clipboard operation
sentiment copied to clipboard

inflected words and adjectives in other languages.

Open pomarc opened this issue 5 years ago • 0 comments

This code to see if a token has a corresponding AFINN label: if (!labels.hasOwnProperty(obj)) continue;

in many non English language does not work for inflected words.

In English you say: "the cat is stupid" and "the cats are stupid" (btw cats are smart! :D )

In French you say: "le chat est stupide" and "les chats sont stupides". The stemmed word "stupid" is flected into stupide and stupides.

if we use "stupid" in the labels, the scoring is never done. Using all the possible inflected words is tedious and long.

may be you could change the line of code to take into account stemmed (or truncated) words?

pomarc avatar Sep 27 '18 11:09 pomarc