SenseVoice icon indicating copy to clipboard operation
SenseVoice copied to clipboard

Multilingual Voice Understanding Model

Results 140 SenseVoice issues
Sort by recently updated
recently updated
newest added

就是 识别到的每一句话 在原始音频中的时间,数据 我也想要,应该怎么设置?

question

我看很多asr模型的输入都是N x T x 80维度的,sensevoice的560维度是什么含义

question

运行demo1,输出结果:开放时间早上9点至下午5点。(vad关闭,itn开启) 运行demo2,输出结果:开饭时间早上9点至下午五点。(itn开启) demo2识别有错字且itn感觉没起作用,为什么啊,用不用automodel怎么会对识别结果产生影响呢?而且为什么demo2itn没起作用,五点没有被输出为5点?

question

效果很好,速度和准确率上表现得都非常不错,期待能有热词的更新!

question

language参数的作用是啥?指定语言吗?但是无论指定哪种语言,输入音频是yue同样也能输出粤语,这正常吗?

question

Traceback (most recent call last): File "/home/pc/.local/bin/torchrun", line 5, in from torch.distributed.run import main File "/home/pc/.local/lib/python3.10/site-packages/torch/__init__.py", line 1308, in from .serialization import save, load File "/home/pc/.local/lib/python3.10/site-packages/torch/serialization.py", line 18, in from...

question

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the...

question

Is the SenseVoice-Large model currently commercialized? If it is commercialized, where can I purchase the API? I would like to use the SenseVoice-Large model for audio event recognition.

question