HanziToAnki
HanziToAnki copied to clipboard
Do better segmentation
trafficstars
We should take inspiration from https://github.com/fxsjy/jieba either finding a Java a library to do so or producing our own implementation. My understanding is that it works by producing a DAG and looking at all the possible ways of segmenting a sentence/clause and using word frequency to calculate a probability.