g2pm
g2pm copied to clipboard
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
Hello, I have trained the Bert model according to your code. How to use the trained Bert model for pinyin annotation? :)
I made a pull requests
According to https://pytorch.org/docs/1.7.1/generated/torch.nn.LSTM.html?highlight=lstm#torch.nn.LSTM `i_t` and `f_t` are calculated incorrectly
1. SOS(BOS) and EOS have no meaning to the effect and can be removed. 2. There are some label errors in the data('儿'r5->er5,"樘"cheng3->cheng1,"骑"ji4->qi2). And after excluding monosyllabic words, the actual...
Hi, guys! I tested some common Chinese Mandarin texts. The g2pM model gets all error results, and pypinyin get all correct results. Here are the examples I tested. 
how to new data?
about test results, you use which Chinese Bert model ? which repo?
`model('今天来的目的是什么?') model = G2pM() model('今天来的目的是什么?') output:['jin1', 'tian1', 'lai2', 'de5', 'mu4', 'de5', 'shi4', 'shen2', 'me5', '?']` 是否是安装问题。 g2pm 版本为0.1.2.4
Hi, Thanks for the good job. I used chinese bert to do this work with your dataset, but I could't get good result like yours. So I want to study...