FunASR
FunASR copied to clipboard
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
在工程中已安装好环境,并且可以运行(runtime\python\http\server.py) 使用Pyinstaller打包 pyinstaller --hidden-import=modelscope --onefile .\server.py 打包成功,但运行出错,看上去运行了模型下载,但模型初始化似乎无效,报错结果如下:  环境及版本说明: Package Version Editable project location ------------------------- ----------- ------------------------- addict 2.4.0 aiofiles 23.2.1 aiohttp 3.9.5 aiosignal 1.3.1 aliyun-python-sdk-core 2.15.1 aliyun-python-sdk-kms 2.16.3 altgraph...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## 🐛 Bug [rank2]: OSError: [Errno 12] Cannot allocate memory ### To Reproduce examples/industrial_data_pretraining/paraformer/finetune.sh ###...
报错内容: Error executing job with overrides: ['++model=iic/speech_paraformer-large-contextual_asr_nat-zh-cn-16k-common-vocab8404', '++train_data_set_list=/home/Group/jiangyf/voice_asr/FunASR/data/Specialty_stores/train.jsonl', '++valid_data_set_list=/home/Group/jiangyf/voice_asr/FunASR/data/Specialty_stores/val.jsonl', '++dataset_conf.batch_size=20000', '++dataset_conf.batch_type=token', '++dataset_conf.num_workers=4', '++train_conf.max_epoch=30', '++train_conf.log_interval=1', '++train_conf.resume=false', '++train_conf.validate_interval=2000', '++train_conf.save_checkpoint_interval=2000', '++train_conf.keep_nbest_models=20', '++train_conf.avg_nbest_model=10', '++optim_conf.lr=0.0002', '++output_dir=./outputs_Specialty_stores_v1'] Traceback (most recent call last): File "../../../funasr/bin/train.py", line 225,...
使用了GPU运行,但是CPU使用率依旧很高?根据性能报告,GPU确实生效。请问这个现象的原因是为什么? 我的代码如下: ```from funasr import AutoModel import time import json # paraformer-zh is a multi-functional asr model # use vad, punc, spk or not as you need model = AutoModel(...
我们想使用 FunASR 做呼叫中心的 录音质检 发现同一段录音,使用: funasr ++model=paraformer-zh ++vad_model="fsmn-vad" ++punc_model="ct-punc" ++input=测试录音文件.wav ---》 正确率还不错,基本都正确 但是,使用:中文离线文件转写服务(CPU版本)版本的 python3 funasr_wss_client.py --host "127.0.0.1" --port 10095 --mode offline --audio_in "测试录音文件.wav" ---》 测试效果非常差,完全不正确 请问,这 2 者出现差异,是因为我的配置问题,还是什么问题,如何解决,非常感谢
python版本的输出 ``` from funasr import AutoModel model = AutoModel(model="iic/speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch", model_revision="v2.0.4", # vad_model="damo/speech_fsmn_vad_zh-cn-16k-common-pytorch", # vad_model_revision="v2.0.4", # punc_model="damo/punc_ct-transformer_zh-cn-common-vocab272727-pytorch", # punc_model_revision="v2.0.4", ) wav_path = 'https://isv-data.oss-cn-hangzhou.aliyuncs.com/ics/MaaS/ASR/test_audio/asr_example_zh.wav' res = model.generate(input=wav_path, hotword='达摩院', batch_size_s=300) print(res) rtf_avg: 0.176:...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the...
您好,我已经通过llm-asr里的脚本,在aishell数据集上进行模型微调,但是模型训练出来后,发现无法利用demo_infer.sh脚本进行模型推理,报错内容如下: hydra error COnfigCompositionException: Ambigouous value for argument '++scope_map=encoder.model, audio_encoder,encoder_projector,adaptor' - OS (e.g., Linux): - FunASR Version 1.0.25 : - ModelScope Version : - PyTorch Version 2.0.1: - How you...