FunASR issues

语音中4个字，经常识别丢失了最后一个字，麻烦看下

3

## ❓ Questions and Help 我们在语音识别中，发现经常会丢失最后一个字，这里提供一个测试的demo，麻烦帮忙看下：环境：Linux ubuntu 在终端运行命令： `funasr ++model="paraformer-zh" ++input=aaaa.wav` 拿到的结果是： [{'key': 'rand_key_2yW4Acq9GFz6Y', 'text': '我要打', 'timestamp': [[1830, 2050], [2050, 2270], [2270, 3175]]}] 实际上期待的文字是：我要打卡，丢失了最后的“卡”字测试音频见附件，解压后即可使用...

yk3372

question

speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch配合speech_eres2net_sv_zh-cn_16k-common使用加载报错

1

如下图所示，使用speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch进行语音识别，配合使用speech_eres2net_sv_zh-cn_16k-common作为说话人识别，加载speech_eres2net_sv_zh-cn_16k-common时报错了，是还不支持吗？能否支持啊？ ![image](https://github.com/alibaba-damo-academy/FunASR/assets/51120096/898dbd71-2c27-4a83-84dc-fa75566e9099)

NimbleDev

question

使用容器镜像启动funASR服务后使用客户端无法调用成功，提示vad_handle is null

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## ❓ Questions and Help 使用容器镜像启动funASR服务后使用客户端无法调用成功，提示vad_handle is null 后台服务启动日志如下： root@ea758f46b6b5:/workspace/FunASR/runtime# bash run_server_2pass.sh \ > --download-model-dir...

majijia505

question

用funasr加载SenseVoiceSmall模型提示 not registered

#### 用funasr加载SenseVoiceSmall模型提示 not registered #### Code ``` from funasr import AutoModel model = AutoModel(model=p.model) ``` p.model是我传入的参数，是我已经下载好的SenseVoiceSmall模型的路径然后提示我：AssertionError: C: \REAPER\UserPlugins\funasr \SenseVoiceSmall is not registered - OS: Win10 - FunASR Version: 1.1.4

dsyrock

question

中文离线转写服务(CPU)如何配置说话人识别模块

10

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## ❓ Questions and Help 中文离线转写服务(CPU)如何配置说话人识别模块，目前没有看到说话人识别的model配置？ #### Code nohup bash run_server.sh \ --download-model-dir /workspace/models...

jlljill

question

关于funasr 4.5 的docker包的问题改进

希望能解决以下问题，针对funasr 4.5-cpu 的docker包 1，支持不了并发，只要服务中有未完成任务，这个时候再发送数据，后台服务就直接死掉了重启了 2，当发送文件无效，服务处理出错时，也应该给客户端返回处理错误信息，便于客户端判断处理

dfengpo

bug

speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common 是否支持热词

我看文档是这么描述的 # damo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-onnx（时间戳） # damo/speech_paraformer-large-contextual_asr_nat-zh-cn-16k-common-vocab8404-onnx（nn热词）如果我需要时间戳和热词，那怎么办，是实现不了吗，我理解这里是只能二选一。另外nn热词是什么意思啊

dfengpo

question

iic/SenseVoiceSmall got: TypeError: expected Tensor as element 1 in argument 0, but got str

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## 🐛 Bug 用示例代码测试时报错TypeError: expected Tensor as element 1 in argument 0, but got str...

csn6666

bug

feat:说话人日志pipline增加情绪识别

1

from funasr import AutoModel import logging logging.getLogger().setLevel(logging.INFO) ser_kwargs = {"use_itn": False, "language":"zh", "batch_size":64, "model":"iic/SenseVoiceSmall"} funasr_model = AutoModel(model="paraformer-zh", vad_model="fsmn-vad", punc_model="ct-punc", spk_model="cam++", ser_model="SenseVoiceSmall", ser_kwargs=ser_kwargs, disable_update=True) input_audio ="/Users/whs/PycharmProjects/speakerLog/test.wav" device ="cpu" res = funasr_model.inference_with_vad(input_audio)...

wuhongsheng

请问有没有paraformer实时和vad实时的一体的gpu调用方法，以及vad录音输入问题

2

同时我单独使用vad推理，利用sounddevice从麦克风读取数据，用以下 ``` sample_rate = 16000 # 采样率 channels = 1 # 单声道 dtype = "int16" # 数据类型 blocksize = 1024 # 块大小 def record_audio(): with sd.InputStream( samplerate=sample_rate, channels=channels, dtype=dtype, blocksize=blocksize,...

EvilCalf

question

FunASR
FunASR copied to clipboard

Metadata

语音中4个字，经常识别丢失了最后一个字，麻烦看下

speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch配合speech_eres2net_sv_zh-cn_16k-common使用加载报错

使用容器镜像启动funASR服务后使用客户端无法调用成功，提示vad_handle is null

用funasr加载SenseVoiceSmall模型提示 not registered

中文离线转写服务(CPU)如何配置说话人识别模块

关于funasr 4.5 的docker包的问题改进

speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common 是否支持热词

iic/SenseVoiceSmall got: TypeError: expected Tensor as element 1 in argument 0, but got str

feat:说话人日志pipline增加情绪识别

请问有没有paraformer实时和vad实时的一体的gpu调用方法，以及vad录音输入问题

← Metadata

Owner

Metadata

FunASR FunASR copied to clipboard

Metadata

← Metadata

Owner

Metadata

FunASR
FunASR copied to clipboard