VoxCPM issues

声音错乱以及结束符判定问题

1

目前部署的nano-vllm的版本，感觉使用下来有几个问题 1. 因为推理是传入参考文本和目标文本一起的，所以参考文本的结束符号会影响目标文本，使用下来参考文本的英文标点符号不会判定为文本结束，导致参考文本的结尾几个字 2. 克隆出来的语音和参考音色不一致，甚至性别会变

Monica8023

app.py

7

app.py start fail.

Storyinsea

能否提供从头训练的参数配置

1

非常优秀的工作，请问能否提供从头训练的参数配置

shangqwe123

1.5版本一推理就直接退出，没有任何报错，之前的都可以，用的uv run app.py

1

guodayang49

doc: A lack of documentation makes it hard to quickly get started.

3

This project's documentation is completely wretched (or trash). On macOS, app.py won't even run! The most insane part is that for such a great/powerful project, they don't even bother including...

premade-man

微调数据集收集建议

2

官方给出了微调相关的代码，已经是一个巨大的进步。不过数据集的维护也是生态建设的一部分，因此呼吁看到这个issue的朋友们，可以把数据集共享一下。当然我本人后续也会整理一些自己用到的数据集

rainfallclub

英文朗读的问题

2

### 问题： 1. 为什么中英文混读的时候，英文的劣化这么严重？经常连基本的单词都读不明白。这是这版本目前测试下来最大的问题。纯英文场景我没测试过，但是在中文中穿插着单词，哪怕这是个常见的标准单词，依然有很高的概率出现读法完全错误。这是提示语音的问题还是模型的问题？ **说明:** 我的提示语音使用sensevoice small识别出来的文本，语音朗读是很标准的普通话。

maxin9966

有很大概率出现尾部超长静音

2

prompt_wav,prompt_text没有错误，使用nanovllm_voxcpm库进行推理，模型为1.5B版本。看显卡占用，像是推到了max_generate_length才停下来

JokerJostar

Ability to Be Ran From Background Thread Broken With 1.5

2

We're currently seeing our TTS returning back errors pertaining to `torch compile` and how it's causing the apps not to be able to run with background threads using VoxCPM 1.5....

vltmedia

Info on finetune scripts

14

Thank you for updating finetune support. before diving in to finetuning , could you please share few details that can help the journey 1. can we finetune on any new...

gkiri

VoxCPM
VoxCPM copied to clipboard

Metadata

声音错乱以及结束符判定问题

app.py

能否提供从头训练的参数配置

1.5版本一推理就直接退出，没有任何报错，之前的都可以，用的uv run app.py

doc: A lack of documentation makes it hard to quickly get started.

微调数据集收集建议

英文朗读的问题

有很大概率出现尾部超长静音

Ability to Be Ran From Background Thread Broken With 1.5

Info on finetune scripts

← Metadata

Owner

Metadata

VoxCPM VoxCPM copied to clipboard

Metadata

← Metadata

Owner

Metadata

VoxCPM
VoxCPM copied to clipboard