FunASR
FunASR copied to clipboard
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
1. 修改模型加载代码,适配新的模型文件名。
社区软件包是还不支持结合说话人模型iic/speech_campplus_sv_zh-cn_16k-common实现说话日志功能吗? 我加了spk_model参数,报错了 PARSE ERROR: Argument: --spk_model Couldn't find match for argument
## ❓ Questions and Help #### What is your question? 我的问题是,在使用过程中,突然报错了: Sizes of tensors must match except in dimension 1. Expected size 2 but got size 1 for tensor number...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## 🐛 Bug 我用一个语音离线生成spk,但是发现key与sentence_info内容不能匹配,sentence_info只有key的一半内容。 ### To Reproduce Steps to reproduce the behavior (**always include the...
####question 按照https://github.com/alibaba-damo-academy/FunASR/tree/main/runtime/python/libtorch导出模型出错: python3.8 -m funasr.export.export_model --model-name damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch --export-dir ./export --type onnx --quantize True export model: damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch 2024-05-09 19:49:59,210 - modelscope - INFO - PyTorch version 2.1.2 Found. 2024-05-09 19:49:59,210 -...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## 🐛 Bug when I test audio asr, the memory always to go up, no...
我看3d-speaker已经支持C++了  请要funasr要怎么才能使用说话人日志模型,没找到有对应参数传入这个模型。 或者结合这个模型使用[iic/speech_campplus_speaker-diarization_common](https://www.modelscope.cn/models/iic/speech_campplus_speaker-diarization_common/files)
我用进程方式启动AutoModel处理,16K单声道的wav音频数据,vad模型内部处理数据直接卡住不动,请大佬帮我看看进程启动下vad模型内部处理数据为什么会卡住。 funasr->utils->load_utils.py的64行 data_or_path_or_list = data_or_path_or_list.mean(0) ps:用线程模式就能正常执行,但咱们线程模式长时间运行,有严重的内存泄漏,而且vad不支持多线程。 Code ..... pool = multiprocessing.Pool(2) pool.apply(func=convert, args=(wavepath)) ..... def convert(wavepath): model = AutoModel(model="iic/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch", model_revision="v2.0.4", vad_model="iic/speech_fsmn_vad_zh-cn-16k-common-pytorch", vad_model_revision="v2.0.4", punc_model="iic/punc_ct-transformer_zh-cn-common-vocab272727-pytorch", punc_model_revision="v2.0.4", spk_model="iic/speech_campplus_sv_zh-cn_16k-common", spk_model_revision="v2.0.2", ncpu=1, device="cpu", disable_pbar=True, disable_log=False...
 Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## 🐛 Bug offline results 少了开头的一个字 ### To Reproduce Steps to reproduce the behavior...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## 🐛 Bug 某些情况后处理会抛IndexError异常 ### To Reproduce Steps to reproduce the behavior (**always include the...