FrequencyWords icon indicating copy to clipboard operation
FrequencyWords copied to clipboard

German words

Open DiplEng opened this issue 5 years ago • 4 comments

For german words it would be really beneficially if they could be written properly -> Nouns are written capitalized. So not "freund" but "Freund".

This would allow this list to be used for spellchecking.

DiplEng avatar Mar 12 '19 15:03 DiplEng

True, and also in German the pronouns "sie" and "Sie" have different meanings, so differentiating them in statistics could lead to much better results.

amadeomano avatar Jun 06 '19 07:06 amadeomano

In addition, there are a large number of words that should be spelled with an Umlaut "ä", "ö", "ü" but occur also in the list with the Umlaut replaced with "ae", "oe" and "ue".

This is an acceptable spelling only if the Umlaut is not available for some reason. These days we have Unicode and these spellings should not be considered correct under any circumstances.

felix-schneider avatar Sep 11 '19 16:09 felix-schneider

I'm also hoping this can be fixed

BrendanMartin avatar Jul 17 '20 17:07 BrendanMartin

Let me see if I can hack this bit in @felix-schneider - might have to do a find and replace. The issue with nouns @DiplEng and @amadeomano is knowing how to identify them programmatically. The data is just a sentence - most likely manually created

hermitdave avatar Aug 11 '20 13:08 hermitdave