silero-models icon indicating copy to clipboard operation
silero-models copied to clipboard

Feature request - `<phoneme>` support for SSML

Open lagleki opened this issue 2 years ago • 2 comments

🚀 Feature

Allow phonetic pronunciation for necessary words

Motivation

Sometimes it's necessary to customize pronunciation of words with non-standard spelling or word borrowed from other languages. In that case having transcription in IPA or X-SAMPA would be nice (see e.g. Polly for explanation of the syntax)

Pitch

Wrapping IPA or X-SAMPA transcription into a <phoneme> tag makes the engine pronounce the word according to its specification.

Alternatives

Not sure if there are any within the project. Using other projects supporting <phoneme> is possible.

Additional context

lagleki avatar Apr 13 '22 16:04 lagleki

This is a nice feature to have, but probably in semi-distant future

snakers4 avatar Apr 13 '22 17:04 snakers4

This would be really useful, especially because the model mispronounces a lot of words, such as (pronounciation in parentheses): segue, one-time, tap-in, soccer (such-er), one-on-one, lineup, deviates, Thomas (Thumb as), diving (dee-ving), AI (ey), rewind (re-wind like the air), Danish (Dar-nish), mishap (me shap), mishit (me shit), and a lot more.

I'm getting by, by replacing these words by alternative spellings, but it's not ideal, and it's not easy.

MulleDK19 avatar Nov 18 '23 13:11 MulleDK19