SapphireLab comments

Results 75 comments of


                                            SapphireLab

trafficstars

> Faster Whisper ASR large模型，生成的list文件里面出现大量的重复字段，例如： `G:\Datasets\GuYunAGI\VerinAudio\slice\12.flac_0011459520_0011566720.wav|slice|EN|And the time the outside world is, and the time the outside world is, and the time the outside world is, and the time the...

修改 ASR 工具

> > > Faster Whisper ASR large模型，生成的list文件里面出现大量的重复字段，例如： `G:\Datasets\GuYunAGI\VerinAudio\slice\12.flac_0011459520_0011566720.wav|slice|EN|And the time the outside world is, and the time the outside world is, and the time the outside world is, and the...

修改 ASR 工具

> 找到原因了，是模型幻觉问题。如果whisper遇到长时间的沉默就会不断重复之前的短语或短句。 `segments, info = model.transcribe( audio=file, beam_size=5, vad_filter=True, vad_parameters=dict(min_silence_duration_ms=700), condition_on_previous_text=False, suppress_tokens=[], language=language)` 这一部分中我添加了两个参数 condition_on_previous_text=False, suppress_tokens=[]试图抑制幻觉，可以在Webui中添加一个抑制幻觉选项。长时间沉默是指语音中静音段较长？再进行适当切分应该不会出现此问题？

修改 ASR 工具

> > > 找到原因了，是模型幻觉问题。如果whisper遇到长时间的沉默就会不断重复之前的短语或短句。 `segments, info = model.transcribe( audio=file, beam_size=5, vad_filter=True, vad_parameters=dict(min_silence_duration_ms=700), condition_on_previous_text=False, suppress_tokens=[], language=language)` 这一部分中我添加了两个参数 condition_on_previous_text=False, suppress_tokens=[]试图抑制幻觉，可以在Webui中添加一个抑制幻觉选项。 > > > > > > 长时间沉默是指语音中静音段较长？再进行适当切分应该不会出现此问题？ > > 我切分后的语音有些不到三秒，但依旧有此问题。顺带一提， `segments,...

SapphireLab

修改 ASR 工具

修改 ASR 工具

修改 ASR 工具

修改 ASR 工具

修改 ASR 工具

修改 ASR 工具

Windows 整合包如何更新？

Windows系统模型训练环节报错

还是go-webui.bat无法运行的问题

训练GPT-SoVITS报错