simplemma icon indicating copy to clipboard operation
simplemma copied to clipboard

greedy decomposition not working on some german verbs

Open joprice opened this issue 2 years ago • 1 comments

I was testing the example of greedy decomposition for German, which works for the example angekündigten, but when using the word funktioniert, I get funktioniert out. Oddly though, both funktionierten and funktionierte return funktionieren. Testing again with angekündigt, it works as expected and returns the infinitive.

joprice avatar Jan 15 '24 11:01 joprice

Hi @joprice I was working on something else but I take good note of your example. By design the simplemma preserves the word forms rather than overgenerating lemmata.

As it cannot differentiate between word forms I often use the shorter form as a proxy: funktionierten is longer than funktionieren but not funktioniert. Besides, adjectives are often found in similar forms (-iert) although the length is probably the culprit here.

adbar avatar Mar 01 '24 16:03 adbar