g2pm icon indicating copy to clipboard operation
g2pm copied to clipboard

论文示例里的数据输出错误

Open 980202006 opened this issue 4 years ago • 5 comments

model('今天来的目的是什么?') model = G2pM() model('今天来的目的是什么?') output:['jin1', 'tian1', 'lai2', 'de5', 'mu4', 'de5', 'shi4', 'shen2', 'me5', '?'] 是否是安装问题。 g2pm 版本为0.1.2.4

980202006 avatar Sep 28 '20 06:09 980202006

i have same issue

wac81 avatar Oct 19 '20 07:10 wac81

Sorry, I can not understand Chinese. Could you tell me what the issue is in English? Goolgle translation says the output of our model is different from the one described in the paper.

The example of grapheme to phoneme conversion described in the paper is not the output of g2pm. It shows what is the golden pinyin for the given sentence to describe what the grapheme to phoneme is.

As shown in the experiments, our model is not perfect. It makes several errors for disambiguating polyphonic characters.

seanie12 avatar Oct 19 '20 10:10 seanie12

yeah,But for common polyphonic words, the output is often questionable, whether or not the installation package is a problem

980202006 avatar Oct 20 '20 02:10 980202006

The model even makes mistakes for common polyphonic words. g2pm is very simple model for polyphone disambiguation. There is still large room for improvement of g2pm.

seanie12 avatar Oct 20 '20 04:10 seanie12

but your paper output is model = G2pM() model('今天来的目的是什么?') output:['jin1', 'tian1', 'lai2', 'de5', 'mu4', 'di4', 'shi4', 'shen2', 'me5', '?']

wac81 avatar Oct 26 '20 02:10 wac81