kalomaze
kalomaze
This would solve the problem of coming across a channel that formats their song uploads as Song - Artist rather than Artist - Song, I have to manually correct these...
When pre-processing the wavs, they are all normalized to keep the dataset even and consistent. This makes sense to me as a method to ensure data processed is relatively similar,...
A common complaint from Google Colab users is harvest being unusable without manually splitting their song into parts. I wonder if an argument in the command to run python infer-web.py...
I've been told contentvec hubert was trained at 44khz, but there are 3 options in RVC for 40khz, 48khz, and 32khz. This confuses me, as the standard for most recorded...
There seems to be a bug where pth files will a lot of the times save with this number of steps in the filename, which can be confusing and misleading....
I totally understand if this is out of the scope of RVC as a project in general, and I don't have the technical qualifications to say if it would be...
[v1_vs_v2_Examples.zip](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/files/11490148/v1_vs_v2_Examples.zip) 5 minute dataset. I also used the [Mangio fork which adds 'crepe' as a training option](https://github.com/Mangio621/Mangio-RVC-Fork) for both of these models. Maybe for a model with a bigger dataset...
I noticed that the default value seems to have changed again. I would like to know the reasoning because despite messing with the other option for voiceless consonant and crepe...
https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/assets/66376113/ccc991bc-f614-4753-a533-ab0afdd017cf If you check the preprocessed wavs folder before doing feature extract, you can notice that they are not properly aligned when put up next to each other back to...