irg icon indicating copy to clipboard operation
irg copied to clipboard

Traditional and Simplified pairs missing from Unihan DB

Open eisoch opened this issue 5 years ago • 8 comments

T/UCS Trad. S/UCS Simp. Notes
U+3823 U+2BD77 𫵷
U+44E3 U+2C72F 𬜯
U+587F U+2A8FB 𪣻
U+74DB U+24A7D 𤩽
U+894C U+891D G glyph been modified
U+8A7C U+8BD9
U+8F0B U+2AA36 𪨶 different rad.
U+9249 U+94C9
U+9265 U+2CB38 𬬸
U+289C0 𨧀 U+2CB4A 𬭊
U+28A0F 𨨏 U+2CB5B 𬭛
U+28B46 𨭆 U+2CB76 𬭶

eisoch avatar Mar 04 '19 06:03 eisoch

Some T/S pairs have not been listed in the Unihan DB. They are collected here and maybe I will submit a document on this issue in future.

eisoch avatar Mar 04 '19 06:03 eisoch

可以參考這裡 https://github.com/hfhchan/irg/blob/master/kVariants.txt#L603

(獻献 這裡沒有視為繁簡關係

hfhchan avatar Mar 04 '19 06:03 hfhchan

@eisoch Maybe you can email Unihan list so John can update these? It will save time for you and UTC.

@hfhchan 铉 (U+94C9) and 鉉 (U+9249) are mismarked in this repo.

wtn avatar Jul 21 '19 19:07 wtn

Fixed 94c9 on glyphwiki, will sync list later

hfhchan avatar Jul 21 '19 20:07 hfhchan

Another (mismarked) one is 诙 (U+8BD9) and 詼 (U+8A7C).

wtn avatar Jul 21 '19 23:07 wtn

@wtn Had been added into the list. This list is not complete, so I will wait for more pairs in future.

eisoch avatar Jul 22 '19 01:07 eisoch

Andrew just submitted a big set on the Unihan list. Everything in Eiso's table above is either in Andrew's list, or is already in Unihan (from Unicode 12).

wtn avatar Oct 22 '19 21:10 wtn

Unicode 13.0 includes all of these pairs.

wtn avatar Mar 15 '20 17:03 wtn