diceware icon indicating copy to clipboard operation
diceware copied to clipboard

replaces wordlist-6-dice with a brand new word list

Open sts10 opened this issue 2 years ago • 1 comments
trafficstars

Thought I'd take a stab at creating a new 46,656-word list for use in this project. (In a way this works toward the goals of #4.)

The words are from my own scrape of 2012 Google Ngram data and Niceware v. 4.0 word list. I created it using a tool I made called Tidy.

Some attributes describing the new word list:

List length               : 46656 words
Mean word length          : 8.04 characters
Length of shortest word   : 4 characters (able)
Length of longest word    : 12 characters (youthfulness)
Entropy per word          : 15.510 bits

And here are some generated passphrases from the list:

surgeons revivalist whether waders impulsively flippant 
humanely gallon compile integers aught outpouring 
revered enfranchise stalactites spooking daintier wreak 
cribbed slaved gratitude narrowed collection materialists 
fibula diameter nutritional furniture gullies surmise 

Note that, in order to include a wider range of commonly used words, I've allowed words as short as 4 characters and as long as 12 into the list. I realize that that may disqualify this list out-right, but I thought I'd submit this PR anyway.

sts10 avatar Jan 16 '23 23:01 sts10

Thanks! I see the mean word length is 8 characters, so that looks pretty reasonable.

I have some travel coming up which will go through next weekend, so I expect to be able to look through this and try implementing it in a couple of weeks.

-- Doug

dmuth avatar Jan 16 '23 23:01 dmuth