elasticsearch-analysis-baseform icon indicating copy to clipboard operation
elasticsearch-analysis-baseform copied to clipboard

English adjectives are not lemmatized

Open jenwachter opened this issue 11 years ago • 5 comments

For example, "quickly" is not reduced to "quick."

It looks like there are lemma files for nouns and verbs, but not for adjectives. Is there a resource for english adjective lemmatization that could be added to the plugin?

Thanks very much.

jenwachter avatar Nov 18 '13 13:11 jenwachter

Thanks for reporting, I will add adjectives.

jprante avatar Nov 18 '13 13:11 jprante

Hi Jörg, just wondering if you have an ETA on adjectives.

jenwachter avatar Apr 07 '14 13:04 jenwachter

Can you use https://github.com/jprante/elasticsearch-analysis-baseform/blob/master/src/main/resources/en-lemma-utf8.txt ? It contains nouns, verbs, and adjectives.

jprante avatar Apr 10 '14 21:04 jprante

@jprante the link you gave is dead.

jeacott avatar Aug 20 '14 00:08 jeacott

The file was moved to https://github.com/jprante/elasticsearch-analysis-baseform/blob/master/src/main/resources/baseform/en-lemma-utf8.txt

jprante avatar Aug 20 '14 07:08 jprante