FunASR issues

数组越界问题

7

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## 🐛 Bug ### To Reproduce Steps to reproduce the behavior (**always include the command...

clb-123

bug

CT-Transformer标点模型的训练代码没有样例呢，求助！！！

4

## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the docs. #### What is your question? #### Code #### What have you tried? #### What's...

lzl1456

question

cpu版本的runtime，流式输入长语音后内存会增长后不释放

4

您好我在windows下更新了新的版本（1.0.7）之后，部署的cpu版本的runtime/websocket, online模式，当客户端的话筒输入一段较长时长的连续语音时（10s以上），服务端的funasr-wss-server-2pass.exe程序内存占用会缓慢增长，请问这个问题你们有遇到过吗？ #### What's your environment? - OS (e.g., win10,vs2019): - FunASR Version (e.g., 1.0.7): -

apple2333cream

question

runtime error for blank audio file

## 🐛 Bug When running the asr inference for a blank audio file, there might be a runtime error as shown in the following attached. Take the uploaded audio test...

jianganghan

bug

Vad模型处理数据会一直卡住

4

What is your question? 我用进程方式启动AutoModel处理，16K单声道的wav音频数据，vad模型内部处理数据直接卡住不动，请大佬帮我看看进程启动下vad模型内部处理数据为什么会卡住。 funasr->utils->load_utils.py的64行 data_or_path_or_list = data_or_path_or_list.mean(0) ps：用线程模式就能正常执行，但咱们线程模式长时间运行，有严重的内存泄漏，而且vad不支持多线程。 Code ..... pool = multiprocessing.Pool(2) pool.apply(func=convert, args=(wavepath)) ..... def convert(wavepath): model = AutoModel(model="iic/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch", model_revision="v2.0.4", vad_model="iic/speech_fsmn_vad_zh-cn-16k-common-pytorch", vad_model_revision="v2.0.4", punc_model="iic/punc_ct-transformer_zh-cn-common-vocab272727-pytorch", punc_model_revision="v2.0.4", spk_model="iic/speech_campplus_sv_zh-cn_16k-common", spk_model_revision="v2.0.2",...

CNXiDaDa

question

在和whisperx一起使用的时候出现存在版本依赖的问题

2

## 🐛 Bug 和whisperx使用的时候遇到一个版本依赖的问题，如果需要使用最新版funasr和whisperx会出现下列问题 ## 复现流程 1. 如果安转顺序是先安装whisperx，funasr，modelscope默认的whisperx版本是3.1.2，会让whisperx降级，现在whisperx最新版是3.1.3，修复了align使用的bug 2. 如果安装顺序变成先安装funasr,modelscope后安装whisperx会出现datasets版本为2.19.0，这时候下载模型会报错需要降级为2.18.0 ## 正常安装流程所以要正常安装顺序使用的话就要，先安装funasr，modelscope，whisperx，datasets==2.18.0

Honst211

bug

funasr的流式识别在英文上效果不佳

在librispeech上测试model = AutoModel(model="paraformer-zh-streaming"),效果不理想

wwfcnu

question

你好请问，这个流式asr最小支持的语音帧为多少呢，其次它的采样率有没有要求，我看默认的是16khz

现在我前端传不定长的语音帧过来请问这个流式asr还支持吗

tanggang1997

question

runtime docker部署里的所有模型版本已经写死，不支持指定版本

我线下用最新的模型训练好替换官方的模型，但docker启动后仍然下载已经写死版本的模型并覆盖了我的模型，请问下这个能支持自定义吗？在 FunASR/runtime/websocket/bin/funasr-wss-server.cpp 和 FunASR/runtime/websocket/bin/funasr-wss-server-2pass.cpp 中模型指定版本均已写死，如下所示。 `TCLAP::ValueArg model_revision( "", "model-revision", "ASR model revision", false, "v1.2.1", "string");`

juzstu

question

内存溢出

## ❓ Questions and Help 内存溢出，当部署完在线识别服务器之后，进行语音识别（启用热词的情况下），内存占用增长特快， #### What's your environment? - OS (e.g., Linux): - FunASR Version (e.g., 1.0.9): - ModelScope Version (e.g., 1.11.0): - PyTorch Version (e.g., 2.0.0): -...

huangdadahai2015

bug

FunASR
FunASR copied to clipboard

Metadata

数组越界问题

CT-Transformer标点模型的训练代码没有样例呢，求助！！！

cpu版本的runtime，流式输入长语音后内存会增长后不释放

runtime error for blank audio file

Vad模型处理数据会一直卡住

在和whisperx一起使用的时候出现存在版本依赖的问题

funasr的流式识别在英文上效果不佳

你好请问，这个流式asr最小支持的语音帧为多少呢，其次它的采样率有没有要求，我看默认的是16khz

runtime docker部署里的所有模型版本已经写死，不支持指定版本

内存溢出

← Metadata

Owner

Metadata

FunASR FunASR copied to clipboard

Metadata

← Metadata

Owner

Metadata

FunASR
FunASR copied to clipboard