miaohuil

Results 6 issues of miaohuil

我现在用的是ffmpeg6.1.1。 我的config.yaml配置如下: name: 百聆(bailing) version: 1.0 logging: level: debug interrupt: false # 具体处理时选择的模块 selected_module: Recorder: RecorderPyAudio ASR: FunASR VAD: SileroVAD LLM: OpenAILLM TTS: MacTTS Player: CmdPlayer Recorder: RecorderPyAudio: output_file: tmp/...

我是win10系统,缺省没有play命令。 采用了PyaudioPlayer,但报错误3,因该是ChatTTS生成的wav格式不兼容。 将tts.py里面对应部分进行修改: try: # torchaudio.save(tmpfile, torch.from_numpy(wavs[0]).unsqueeze(0), 24000) torchaudio.save(tmpfile, torch.from_numpy(wavs[0]), 24000, encoding="PCM_S", bits_per_sample=16) except: # torchaudio.save(tmpfile, torch.from_numpy(wavs[0]), 24000) torchaudio.save(tmpfile, torch.from_numpy(wavs[0]), 24000, encoding="PCM_S", bits_per_sample=16) 播放兼容问题得以解决。

模型找到了需要调用的function,但无法完成后续操作,日志如下: 2024-10-07 22:54:33,963 - bailing.robot - INFO - Started recording. 2024-10-07 22:54:38,070 - bailing.vad - DEBUG - VAD output: {'start': 58912} 2024-10-07 22:54:39,800 - bailing.vad - DEBUG - VAD output:...

tts生成的开头两个音频播放时会被漏掉

原来的配置: ASR: FunASR: model_dir: models/SenseVoiceSmall output_file: tmp/ 需要修改为: ASR: FunASR: model_dir: FunAudioLLM/SenseVoiceSmall output_dir: tmp/