g2p-seq2seq icon indicating copy to clipboard operation
g2p-seq2seq copied to clipboard

Dealing with words with multiple pronunciations

Open ivancarapinha opened this issue 4 years ago • 2 comments

Hello,

Since this g2p transformer performs phonetic transcription word by word, how does it select the correct pronunciation for a word that has several possible pronunciations? This is very common for many nouns and verbs, for example, the noun "content" and the verb "to content" (to satisfy).

Thank you

ivancarapinha avatar Jul 06 '20 15:07 ivancarapinha

It supports n-best output in theory. As for using part of speech as input feature for training, it is also possible, but requires work on model architecture, and, correspondingly, code.

nshmyrev avatar Jul 06 '20 20:07 nshmyrev

Does that mean , as of now, for training the g2p model, input dictionary should only have 1-best pronunciations? If not, how to handle multiple pronunciations in the training dictionary?

widdiot avatar Jul 01 '21 16:07 widdiot