加入语言模型解码会掉字
我在识别的时候加入语言模型的tlg来解码,会出现很多掉字的现象,wer比没加前要大很多
调下acoustic scale
On Tue, Jun 3, 2025, 16:26 wwfcnu @.***> wrote:
wwfcnu created an issue (wenet-e2e/wenet#2735) https://github.com/wenet-e2e/wenet/issues/2735
我在识别的时候加入语言模型的tlg来解码,会出现很多掉字的现象,wer比没加前要大很多
— Reply to this email directly, view it on GitHub https://github.com/wenet-e2e/wenet/issues/2735, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABFN3Q653FUV3DWKMP7QDVT3BVL4XAVCNFSM6AAAAAB6PBF5NOVHI2DSMVQWIX3LMV43ASLTON2WKOZTGEYTEOJXGEYTMMY . You are receiving this because you are subscribed to this thread.Message ID: @.***>
调下acoustic scale …
调节acoustic scale和length_penalty都有改善,这里优先调节哪一个合适呢
调节acoustic scale和length_penalty之后,解码有时候会出现重复的字
调节length_penalty=-5.0,掉字也会减少