clreq icon indicating copy to clipboard operation
clreq copied to clipboard

List the Chinese characters in Unicode?

Open xfq opened this issue 1 year ago • 3 comments

It might be useful to list the Chinese characters in Unicode, like klreq and alreq:

  • The basic set (U+4E00-U+9FA5), i.e., ISO/IEC 10646:1993
  • CJK Unified Ideographs Extension A, i.e., U+3400-U+4DB5 in ISO/IEC 10646:1999
  • U+3400-U+9FFF (BMP Chinese characters)
  • U+20000-U+2FFFF, i.e., CJK Unified Ideographs Extension B to Extension F (Extension I in September 2023), commonly known as the Supplementary Ideographic Plane (SIP)
  • U+30000-U+3FFFF, i.e., CJK Unified Ideographs Extension G to Extension H, commonly known as the Tertiary Ideographic Plane (TIP)
  • CJK Compatibility Ideographs in the Basic Multilingual Plane (U+F900-U+FAFF)

xfq avatar Feb 13 '24 09:02 xfq

Should CJK Compatibility Ideographs be abandoned?

yisibl avatar Apr 18 '24 11:04 yisibl

Should CJK Compatibility Ideographs be abandoned?

There seem to be some standard Chinese characters in CJK Compatibility Ideographs. @eisoch?

xfq avatar Apr 21 '24 04:04 xfq

U+3007 (〇) IDEOGRAPHIC NUMBER ZERO in CJK Symbols and Punctuation (U+3000..U+303F) is also considered a hanzi by standards, dictionaries and UCS according to 「〇」算不算汉字? - 知乎 (Is “〇” a hanzi? - Zhihu).

Additionally, outside the list @xfq provided above, there are some other characters with script property “Han” in UCD, such as U+3005 (々) IDEOGRAPHIC ITERATION MARK and Suzhou numerals (U+3021..U+3029). Should they be listed?

AmeroHan avatar Aug 31 '24 08:08 AmeroHan