SenseVoice issues

如何导出字幕格式的文件,?

2

就是识别到的每一句话在原始音频中的时间,数据我也想要,应该怎么设置?

webkonglong

question

为什么sensevoice模型的输入是N x T x 560？

1

我看很多asr模型的输入都是N x T x 80维度的，sensevoice的560维度是什么含义

dlkht

question

为什么demo1和demo2的识别example里中文音频的效果不一样？

2

运行demo1，输出结果：开放时间早上9点至下午5点。（vad关闭，itn开启）运行demo2，输出结果：开饭时间早上9点至下午五点。（itn开启） demo2识别有错字且itn感觉没起作用，为什么啊，用不用automodel怎么会对识别结果产生影响呢？而且为什么demo2itn没起作用，五点没有被输出为5点？

Hou-MY

question

期待能有热词的更新

1

效果很好，速度和准确率上表现得都非常不错，期待能有热词的更新！

liduowen

question

language参数的作用是啥？指定语言吗？但是无论指定哪种语言，输入音频是yue同样也能输出粤语，这正常吗？

Jimmy-L99

question

模型提供的http接口/api/v1/asr，java使用hutool工具类调用会报内部错误，请问有什么解决方法吗？

niedamin

question

ModuleNotFoundError: No module named 'typing_extensions'

1

Traceback (most recent call last): File "/home/pc/.local/bin/torchrun", line 5, in from torch.distributed.run import main File "/home/pc/.local/lib/python3.10/site-packages/torch/__init__.py", line 1308, in from .serialization import save, load File "/home/pc/.local/lib/python3.10/site-packages/torch/serialization.py", line 18, in from...

daixun0913

question

运行dome1提示了一些没有注册的错误信息,不知道是为什么?

7

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the...

hnxueque

question

Is the SenseVoice-Large model currently commercialized?

Is the SenseVoice-Large model currently commercialized? If it is commercialized, where can I purchase the API? I would like to use the SenseVoice-Large model for audio event recognition.

JinYuanZhang999

question

SenseVoice
SenseVoice copied to clipboard

Metadata

如何导出字幕格式的文件,?

可用CPU跑吗？自己是mac电脑

为什么sensevoice模型的输入是N x T x 560？

为什么demo1和demo2的识别example里中文音频的效果不一样？

期待能有热词的更新

language参数的作用是啥？指定语言吗？但是无论指定哪种语言，输入音频是yue同样也能输出粤语，这正常吗？

模型提供的http接口/api/v1/asr，java使用hutool工具类调用会报内部错误，请问有什么解决方法吗？

ModuleNotFoundError: No module named 'typing_extensions'

运行dome1提示了一些没有注册的错误信息,不知道是为什么?

Is the SenseVoice-Large model currently commercialized?

← Metadata

Owner

Metadata

SenseVoice SenseVoice copied to clipboard

Metadata

← Metadata

Owner

Metadata

SenseVoice
SenseVoice copied to clipboard