K Rones

Results 15 comments of K Rones

This caught me out for ages. Try including at least 32 wav files. I'm not sure why/where this is specified as a requirement. But I came across this in a...

I'm afraid I'm still learning, so can't offer much advice in that regard. However, I do think you need to make sure that the language specified in the preprocessing matches...

If you want to test the pipeline you can just copy paste your existing files to make duplicates till you have 32. Example, just using the first 3: wav/000.wav|male1|Одата Левски...

Ah, try removing the "wav/" prefix in the metadata. It is not required assuming you are giving a data directory where the csv and wav folder are both located and...

Are you specifying single speaker?: python3 -m piper_train.preprocess --language en-us --input-dir /path/to/dataset_dir/ --output-dir /path/to/training_dir/ --dataset-format ljspeech **--single-speaker** --sample-rate 22050 If so, I don't think it expects a speaker column. Sorry,...

I'm afraid I'm all out of ideas. It could be a mismatch between the model and the training data that we're not seeing. Can you link the hugginface link for...

Apologies for maybe confusing things but I raised an issue over on the Huggingface page that I think may be related: https://huggingface.co/datasets/rhasspy/piper-checkpoints/discussions/8 In short, I'm trying to use the [libritts_r](https://huggingface.co/datasets/rhasspy/piper-checkpoints/tree/main/en/en_US/libritts_r)...

From my HuggingFace comment: It's the Speaker count. From the config.json of the model: "num_speakers": 904, I went through my training data. Duplicated it until there were 904 instances. Then...

Just to add a note of interest as highlighted by a reddit user: The last two images show new generations I did in both installs just for this post to...

> I will take a look this next time I am on forge dev what will you get if you use the info in original newest a1111? So this might...