Johnny
Johnny
Nice work but from 350000+ lines only around 2500 survived? Seems like the parameters used have been a little too strict...
``~30 000`` would be [closer to reality](https://englishlive.ef.com/blog/language-lab/many-words-english-language/) but it appears to have duplicated a bunch of words as well which were not duplicated on the original ``words_alpha.txt``. See bedrock. bedroll,...
> The API is free for 2500 words per day. That is probably why.... @Orivoir did get `~30 000` words just by using different parameters, so that was probably not...
@SDidge At first glance I can't seem to find any non-english words on the file so I'd say this one is the cleanest file so far, nice work!