english-words icon indicating copy to clipboard operation
english-words copied to clipboard

Y isn't sorted properly

Open lotsofs opened this issue 4 years ago • 2 comments

Some words with the letter Y appear under the letter I instead.

Examples: NYET appears in between NIESHOUT and NIETZSCHE

ABIRRITATIVE is followed by ABYS ABYSM ABYSMAL [ etc ] before continuing with ABYSSUS ABISTON ABIT

XXV -> XXX -> Z -> ZA -> ZABAEAN

Words like YACHT, YACCA, YARTH etc all appear under I

Did you perhaps use the Frisian alphabet when sorting the list?

lotsofs avatar May 29 '20 15:05 lotsofs

if you're in javascript you can use .sort to sort them alphabetically Hope this helps

RUGMJ avatar Dec 06 '20 14:12 RUGMJ

if you're in javascript you can use .sort to sort them alphabetically

True, but even the shorter 370k-word "alpha" version can take about 30 seconds to sort, so you wouldn't want to do it at runtime, you'd want to save the result as a new file - and make it available here.

So that's what I'll do! List.Sort() and System.IO.File.WriteAllLines() provided this result in UTF-8 (same as original words_alpha.txt, I believe) with what looks to me like the same CRLF line endings. I had to manually delete one extraneous CRLF at the end of the file but otherwise this is just as C# outputted it.

words_alpha_sorted.txt

JayArrrgh avatar Feb 01 '21 20:02 JayArrrgh