unidecode icon indicating copy to clipboard operation
unidecode copied to clipboard

I wanna "香港" to be "Hong Kong", not the Fxxking "Xiang Gang"

Open dukelec opened this issue 11 years ago • 2 comments

Why chinese character always coded as Pinyin? Pinyin only used in Mandarin (PuTongHua), Chinese people in Kwangtung, HongKong and Macao using Cantonese, not Mandarin!

dukelec avatar Nov 22 '14 04:11 dukelec

Oops, never used Cantonese, or Mandarin. Can you please open PR with the proposed change?

iki avatar Nov 24 '14 12:11 iki

To support this behavior, it need to be aware with preferred language. Unidecode has a single mapping table and it is different with the demand.

I've developed an enhanced version of unidecode, named unihandecode that has a basic capability for it. https://github.com/miurahr/unihandecode

You ca see an experimental support for Cantonese in master branch(look at test code). It uses the Unicode standard Unihan reading definitions for dictionary. This may help you.

miurahr avatar Jun 08 '15 23:06 miurahr