elasticsearch-analysis-baseform
elasticsearch-analysis-baseform copied to clipboard
English adjectives are not lemmatized
For example, "quickly" is not reduced to "quick."
It looks like there are lemma files for nouns and verbs, but not for adjectives. Is there a resource for english adjective lemmatization that could be added to the plugin?
Thanks very much.
Thanks for reporting, I will add adjectives.
Hi Jörg, just wondering if you have an ETA on adjectives.
Can you use https://github.com/jprante/elasticsearch-analysis-baseform/blob/master/src/main/resources/en-lemma-utf8.txt ? It contains nouns, verbs, and adjectives.
@jprante the link you gave is dead.
The file was moved to https://github.com/jprante/elasticsearch-analysis-baseform/blob/master/src/main/resources/baseform/en-lemma-utf8.txt