MaxMax2016 comments

Results 243 comments of


                                            MaxMax2016

incorrect audio shape

4 cut audio, less than 30 seconds for whisper

ValueError: not enough values to unpack (expected 4, got 1)

有更详细的错误信息没有？

ValueError: not enough values to unpack (expected 4, got 1)

训练文件train.txt里面多了空白行？

I have a question about using lora for fine-tuning

there has no vits, just a bigvgan, after upsample layers use speaker info to change x with weights and biases.

I have a question about using lora for fine-tuning

lora is low rank adapter, here is another adapter from micsoft [AdaSpeech: Adaptive Text to Speech for Custom Voice](https://arxiv.org/pdf/2103.00993.pdf) or [Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers](https://arxiv.org/abs/2211.00585)

I have a question about using lora for fine-tuning

lora_svc is not real lora, use this name is just want svc developers to think about lora.

音色是否可以融合或者调整

音色A * x +音色B * (1-x) ->新的音色， 0

训练一切正常，但是保存第一个模型时出现了ZeroDivisionError: float division by zero

验证集是空的，从训练集剪切前五条放到验证集

`svc_preprocess_f0.py` rootPath should be changed to 48k?

yes， you can use 48k to get pitch, but parameters should be modified.

Training sample size

more than ten minutes to fine tune pretrain model.