FunASR
FunASR copied to clipboard
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
FunASR离线文件转写服务GPU版本可以支持ARM平台跟国产化NPU卡吗?例如华为910B
## 🐛 Bug Errors may occur when calculating timestamps:“IndexError: list index out of range” #### Code sample imput is BytesIO ``` model = AutoModel( model=model, vad_model=vad_model, spk_model=spk_model, spk_mode="vad_segment", device=device, disable_update=True...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## 🐛 Bug [音频地址](https://media.xyzcdn.net/6566a3072f622ec0e5f43aeb/lqgXBEJrEQNv_FPoSRNRpB0RDCVc.m4a),使用如下模型代码识别 ``` AutoModel( model="paraformer-zh", model_revision="v2.0.4", vad_model="fsmn-vad", vad_model_revision="v2.0.4", punc_model="ct-punc-c", punc_model_revision="v2.0.4", device=self.device, disable_pbar=True, disable_update=True,...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the...
[W shape_analysis.cpp:841] failed PropagateTensorShapeOnNode with schema: aten::pad(Tensor self, int[] pad, str mode="constant", float? value=None) -> (Tensor) %input.987 : Float(*, *, *, requires_grad=0, device=cuda:0) = aten::pad(%input.985, %5760, %5761, %5762), scope: __module.model.encoders/__module.model.encoders.48/__module.model.encoders.48.self_attn/__module.model.encoders.48.self_attn.pad_fn...
Got Error when loading funasr ```python Traceback (most recent call last): File "/home/yihua.zhou/asr-Paraformer/Paraformer-predict.py", line 28, in from modelscope.pipelines import pipeline File "/opt/conda/lib/python3.11/site-packages/modelscope/pipelines/__init__.py", line 4, in from .base import Pipeline File...
我看项目的model里面只有sensevoice,里面结构对应的是SenseVoiceSmall的结构,funasr是不是不能微调SenseVoiceLarge?
1.服务器无网络 2.模型提前下载到了服务器中并挂载到了容器内部 3.wss脚本可以正常拉起,一直请求下载模型,最终可以启动成功也可以调用,但是过一段时间后容器内wss会自动重启. 4.启动参数: cd FunASR/runtime && nohup bash run_server_2pass.sh --certfile 0 --download-model-dir /workspace/models --vad-dir damo/speech_fsmn_vad_zh-cn-16k-common-onnx --model-dir damo/SenseVoiceSmall-onnx damo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-onnx --online-model-dir damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-online-onnx --punc-dir damo/punc_ct-transformer_zh-cn-common-vad_realtime-vocab272727-onnx --lm-dir damo/speech_ngram_lm_zh-cn-ai-wesp-fst --itn-dir thuduj12/fst_itn_zh --hotword /workspace/models/hotwords.txt >...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ 我想批量识别视频句子的情绪, 每个句子都很短, 但是连续运行显存会越来越大, 显存变化如下: 2.2 2.2 2.6 2.2 2.3 2.4 2.7 2.4 2.5...