FunASR icon indicating copy to clipboard operation
FunASR copied to clipboard

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Results 555 FunASR issues
Sort by recently updated
recently updated
newest added

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ Questions and Help [W shape_analysis.cpp:841] failed PropagateTensorShapeOnNode with schema: aten::pad(Tensor self, int[] pad,...

question

启动容器: docker run -p 10096:10095 -it --privileged=true \ -v $PWD/funasr-runtime-resources/models:/workspace/models \ registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.12 运行脚本: nohup bash run_server_2pass.sh \ --download-model-dir /workspace/models \ --vad-dir damo/speech_fsmn_vad_zh-cn-16k-common-onnx \ --model-dir iic/SenseVoiceSmall-onnx \ --online-model-dir damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-online-onnx \ --punc-dir...

question

修改FunASR实时识别框架,实时识别时2pass模式下支持框架层面返回句子级别的时间戳,单位毫秒

在paraformer_streaming文件的demo中是对一个wav文件进行流式推理。但是现在我有一批wav,需要调用批量推理,而不是for循环文件,一条一条的推理,请问如何实现?

question

![image](https://github.com/user-attachments/assets/e34ae8fb-0693-4989-af0c-7ae4e1eecec9) 我已经通过 hf 参数自动下载过一次模型了,但是每次运行以下代码还会联网 download 一次,能不能直接指定文件路径不调用下载? 这样传入参数每次都报错! ```python self.model = AutoModel( model = ”/home/111/.cache/huggingface/hub/models--FunAudioLLM--SenseVoiceSmall“, # vad_model = vad_model, # vad_kwargs=vad_kwargs, device = device, disable_update = disable_update ) ``` ```cmd [111@zernithos]$...

question

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## 🐛 Bug 在使用代码单独调用SANMEncoder进行debug时候发现,paraformer-zh配置文件中`SANMEncoder`的参数`sanm_shift`并不存在于SANMEncoder的代码实现中,变量涵义不明。当使用以下方法进行调试时,代码发生报错。 ### To Reproduce ```python from funasr.register import tables kwargs = {...

bug

## ❓ Questions and Help ###问题1 请问有llm-asr-nar的conf吗,我感觉我自己配置的文件不是很对: ``` model: LLMASRNAR model_conf: lsm_weight: 0.1 # label smoothing option length_normalized_loss: true encoder: SANMEncoder encoder_conf: hub: funasr init_param_path: "/ssd/zhuang/code/LLM/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch/" freeze: true llm: Qwen2.5-1.5B-Instruct...

question

funasr fsmn_ctc kws 不支持转onnx吗?有计划实现export onnx吗?

question

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ Questions and Help 跑推理时英文唤醒词怎么设置? 脚本/代码无法处理带空格的唤醒词,比如"hey siri"

question

spk_model使用cam++后会报”string index out of range“,去掉后则没有问题 附源代码: ``` from funasr import AutoModel # paraformer-zh is a multi-functional asr model # use vad, punc, spk or not as you need path =...

question