libchewing
libchewing copied to clipboard
chewing-cli 應該也要有注音修正功能
Discussed in https://github.com/chewing/libchewing/discussions/654
Originally posted by llc0930 November 5, 2024
chewing-cli 應該也要有注音修正功能。
一修正為ㄧ;丫修正為ㄚ。
尤其是前者...教育部的xls跟xlsx真是個噩夢
chewing-cli 生成詞庫時顯然會去掉重複字詞,所以沒有做去重... 《成語典》dict_idioms_2020_20240926.txt
chewing-cli init-database -n "《成語典》" -c "中華民國教育部" -l "CC BY-ND 3.0 臺灣" -r "2020_20240926" ./《成語典》dict_idioms_2020_20240926.txt dict_idioms_20240926.dat
== Trie Dictionary Statistics ==
Node count : 19226
Leaf count : 5186
Phrase count : 5456
Max height : 9
Average height : 1
Root branch count : 705
Max branch count : 80
Average branch count : 0
```</div>
其他的簡單修正 https://github.com/chewing/libchewing/discussions/656#discussion-7450782