diceware icon indicating copy to clipboard operation
diceware copied to clipboard

german wordlist from dys2p

Open b068931cc450442b63f5b3d276ea4297 opened this issue 2 years ago • 2 comments

Since the usability of Diceware depends on the quality of the word lists, a word list should consist of words that are as familiar and easy to remember as possible. The best known German diceware word list diceware_german.txt does not meet these conditions. It contains special characters, numbers, letter sequences that are neither an abbreviation nor a word (e.g. zv, zw, zx, zy, zz, zzz and zzzz) and letter sequences that resemble words but are not themselves words of the German language.

Our word list de-7776 is suitable as a diceware word list for five cubes. The words are unique from the fifth letter on. Furthermore, it follows these rules for the most part, but not one hundred percent:

  • Words are three to twelve characters long.
  • No word contains the characters ä, ö, ü and ß.
  • If possible, only familiar nouns, verbs and adjectives should be included, and in their basic form (nouns in the singular, verbs in the infinitive, adjectives in their uninflected form).
  • No proper names, regions, religions, associations, or persons.
  • No words with particularly negative connotations.
  • The "masculine" grammatical gender is preferred. (This is standard for BIP39.)

The list definitely looks good and would be a vast improvement over the default German list in absence of #59. Are the words hand picked or does the list follow a similar methodology to the DeReKo list? Skimming over your list I found multiple verbs both in infinitive and third-person singular, spelled out numbers are present while e.g. colors are missing.

pomeloy avatar Mar 08 '23 17:03 pomeloy

Thank you very much, I am sorry you are right. The list is handpicked and I will be able to revise it in the coming (hopefully) weeks.