HanziToAnki icon indicating copy to clipboard operation
HanziToAnki copied to clipboard

Do better segmentation

Open danielt998 opened this issue 1 year ago • 1 comments
trafficstars

We should take inspiration from https://github.com/fxsjy/jieba either finding a Java a library to do so or producing our own implementation. My understanding is that it works by producing a DAG and looking at all the possible ways of segmenting a sentence/clause and using word frequency to calculate a probability.

danielt998 avatar Sep 10 '24 22:09 danielt998