Request: suggest where to get UnicodeData.txt
The README says UnicodeData.txt is required and suggests the URL http://www.unicode.org/Public/. That URL presents me with 44 directories and no indication which of them might contain a file named UnicodeData.txt. I can understand that versions change and you might not want to link to a specific version, but at least could you give a hint as to which directory people should be looking at? There seems to be a file with that name in UNIDATA/ so I'll start with that, but since you didn't link directly there I'm wondering if there might be other versions in other directories (that file does seem to work).
This unicode module looks very useful, thanks.
That's a good point - the directory surely changed a lot (becoming much more complex) since the first version of the unicode utility... As the first step, in 44533629f7b49a299b705fe02bdb58d58120de3a I implemented --download switch to download it automatically (the version is hardwired for the moment).