FunASR
FunASR copied to clipboard
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the docs. #### What is your question? #### Code #### What have you tried? #### What's...
## ❓ Questions and Help **运行时,提示:KeyError: 'FunASRPipeline: KeyError("GenericFunASR: \'config\'")'** #### What is your question? #### Code `from modelscope.pipelines import pipeline` `from modelscope.utils.constant import Tasks` `AudioFile = 'D:/project/Paraformer/data/output.wav'` `inference_pipeline = pipeline(...
## 🐛 Bug 在docker 容器中执行 python 脚本 推理命令 报错 ### To Reproduce Steps to reproduce the behavior (**always include the command you ran**): 1. 在root目录下 Run cmd 'python main.py' 2....
``` from funasr_onnx import Paraformer,CT_Transformer import time asr_model_path = "./speech_paraformer-large_asr_nat-zh-cantonese-en-16k-vocab8501-online" print("model: ", asr_model_path) asr_model = Paraformer(asr_model_path, batch_size=1, quantize=True) wav_path = ['./chatglm2.wav', 'chengyu.wav', 'chinese_10s_16k.wav', 'hongqiao.wav', 'Sheldon_10s.wav','jfk.wav'] for wavfile in wav_path: st...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## 🐛 Bug 1. Use docker Version: registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.4 2. Run /workspace/FunASR/runtime/run_server.sh as referred to https://github.com/alibaba-damo-academy/FunASR/blob/main/runtime/docs/SDK_advanced_guide_offline_zh.md#python-client....
 我是通过dokcer镜像版本funasr-runtime-sdk-cpu-0.4.4 (2dc87b86dc49)进行部署的,但是发现还不支持说话人日志功能。 希望runtime/run_server.sh能尽快支持spk-model参数,结合speech_campplus_speaker-diarization_common模型实现说话人日志功能。
## 🐛 Bug qwen-audio + vad 运行报错 ### To Reproduce 1. Run cmd `python qwen_demo.py` 2. See error ``` 2024-05-14 11:09:35,110 - modelscope - INFO - PyTorch version 2.3.0 Found....
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## 🐛 Bug 在runtime环境下使用speech_paraformer-large-contextual_asr_nat-zh-cn-16k-common-vocab8404-onnx模型。添加如下热词表时感觉热词直接会有相互干扰。比如 针灸铜人 80 久通 80 测试可能会出现针灸通人、久铜等结果。请问添加热词是单独提高token概率吗。如果是全词匹配的话,按说wfst里影响不会这么大。有无办法解决? ### Environment - OS (e.g., Linux):...