vosk-api icon indicating copy to clipboard operation
vosk-api copied to clipboard

where to get the dictionary of vosk-model-en-us-0.22-lgraph

Open YangangCao opened this issue 2 months ago • 8 comments

Hi, dear author, I want to get the dictionary of vosk-model-en-us-0.22-lgraph to check every phone in a word, where can I get? Thanks very much

YangangCao avatar Apr 08 '24 16:04 YangangCao

It is inside the compilation package

https://alphacephei.com/vosk/models/vosk-model-en-us-0.22-compile.zip

nshmyrev avatar Apr 08 '24 16:04 nshmyrev

ok got it, thanks for your quick and accurate reply!

YangangCao avatar Apr 08 '24 16:04 YangangCao

Hi, sorry to bother you, I find some phone which is hard to read, for example electromagnetic electromagnetic @_B l_I E_I k_I t_I r_I oU_I m_I {_I g_I n_I E_I 4_I I_I k_E electromagnetic electromagnetic @_B l_I E_I k_I t_I r_I oU_I m_I {_I g_I n_I E_I t_I I_I k_E electromagnetic electromagnetic I_B l_I E_I k_I 4_I r_I oU_I m_I {_I g_I n_I E_I 4_I I_I k_E electromagnetic electromagnetic I_B l_I E_I k_I t_I r_I oU_I m_I {_I g_I n_I E_I t_I I_I k_E

what's 4_I ? And the word "electromagnetic" have 4 kinds of phone arrangements, I want to calculate GOP(goodness of pronunciation), how to decide the only one phone arrangement?

YangangCao avatar Apr 08 '24 16:04 YangangCao

what's 4_I ?

4 is a sampa phone something like "ch".

_I is for word internal, you are probably looking inside intermediate lexicon instead of original one.

I want to calculate GOP(goodness of pronunciation), how to decide the only one phone arrangement?

run alignment

nshmyrev avatar Apr 09 '24 02:04 nshmyrev

thanks for your reply, it help me a lot

YangangCao avatar Apr 09 '24 06:04 YangangCao

Hi dear author, why the vosk use different phone system with Kaldi? for example, "@", "{", "4" in vosk model, but doesn't in Kaldi model, Any idea to map from one to the other?

YangangCao avatar Apr 26 '24 04:04 YangangCao

That particular model uses different phoneset unfortunately. You can still map it easily, it is a simple mapping. Other models like gigaspeech use standard cmu dictionary

nshmyrev avatar Apr 26 '24 06:04 nshmyrev

Thanks for you quick and accurate reply, I know gigaspeech model right know and it is good enough to me, I don't plan to map phoneset! thanks!

YangangCao avatar Apr 26 '24 07:04 YangangCao