hanzi-tools
hanzi-tools copied to clipboard
Does segment support splitting Traditional Chinese into words?
as title.
or, do i need to convert it into simplified chinese first, and then convert it back?
It probably won't work very well for traditional characters because the segmentation library used (jieba) is trained on simplified texts. For now you'll probably have to convert to simplified first.