SenseVoice issues

why is it so slow to prepare train.json

To check that the finetune process is ok , i use 30w sentences for training I found it is very slow to prepare train.json. 8 hours passed, the train.json of...

housebaby

question

微调训练模型，训练loss收敛，验证loss不收敛

4

![lALPM3V2q2_DQgLNA6fNAs8_719_935](https://github.com/user-attachments/assets/2fab2a5a-e6be-4f7a-9837-1b9d87fd2c7b)

wuhongsheng

question

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## ❓ Questions and Help 设置的batch_size为128，但是得到的batchsize为1，导致微调失败 ### Before asking: 1. search the issues. 2. search...

yangppde

question

sensevoice 微调加载数据出错

6

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## ❓ Questions and Help 数据安装文档使用sensevoice2jsonl 生成的数据，微调时加载数据出现 data is empty! ### Before asking: 1. search...

yangppde

question

运行api报错

python api.py 或是用fastapi启动时 ![b9c037ea3681c7eef09f0403f11c895](https://github.com/user-attachments/assets/78b5770c-d718-44ff-b2e1-237b0bee09d8) ![255d04b83bdc692b71b409e740cde28](https://github.com/user-attachments/assets/7154eb9e-bcd2-446c-a5bd-383790f83aad)

EINEZIO

bug

SenseVoiceSmall微调是否支撑增加事件/情绪/语言类型

6

SenseVoiceSmall微调是否支撑增加事件/情绪/语言类型？经查阅源代码后发现funast/models/sensevoice/model.py中line 640-647中给出了情绪、语言的编码字典，但并没有事件相关的，想请问可以通过微调增加模型能检测的事件/情绪/语言吗？

Danyuhui

question

无法识别wav文件

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## 🐛 Bug 可以正常识别mp3格式文件，但是不能识别wav文件，会卡住不动,直到你ctrl+c ### To Reproduce Steps to reproduce the behavior (**always include the...

leslie2046

bug

修改哪些位置可以使其在torch2.4.0环境下正常运行

## ❓ Questions and Help #### What is your question? 如何在torch 2.4.0环境下使用sensevoice 我正在做一个纯本地运行的语音对话的项目，使用SenseVoice+LLM+GPT-SoVITS来达到语音对语音交流的效果，其中部分模块必须要使用torch2.4.0，所以求大佬帮忙看下SenseVoice这个项目怎么样才能在torch2.4.0环境下正常运行异常见下图： ![image](https://github.com/user-attachments/assets/ba9ab113-6526-4b52-9cd9-4ae9d9c546a9) ①运行在纯净的虚拟环境下，仅将torch升级为2.4.0，python版本3.11.2； ②torchaudio的版本与torch同步； ③尝试过修改`weights_only=True`，除了少了几行提示其他没有区别） #### Code ```python from funasr import AutoModel from funasr.utils.postprocess_utils import rich_transcription_postprocess import...

LuWu9

question

What languages does SenseVoice recognize?

How is it with recognizing other languages? From the description it appears that the model was trained on 50 languages. How can you use languages other than the standard ones?...

brainhome

question

Question related rtf_avg, time_speech, time_escape ?

## ❓ Questions below is the generated output from the model, is there any chance I can ignore these rtf_avg,time_speech,time_escape from the model, as I only want to see the...

gmumar788

question

SenseVoice
SenseVoice copied to clipboard

Metadata

why is it so slow to prepare train.json

微调训练模型，训练loss收敛，验证loss不收敛

微调时batch_size设置问题

sensevoice 微调加载数据出错

运行api报错

SenseVoiceSmall微调是否支撑增加事件/情绪/语言类型

无法识别wav文件

修改哪些位置可以使其在torch2.4.0环境下正常运行

What languages does SenseVoice recognize?

Question related rtf_avg, time_speech, time_escape ?

← Metadata

Owner

Metadata

SenseVoice SenseVoice copied to clipboard

Metadata

← Metadata

Owner

Metadata

SenseVoice
SenseVoice copied to clipboard