Retrieval-based-Voice-Conversion-WebUI
Retrieval-based-Voice-Conversion-WebUI copied to clipboard
Hubert (latent variables) to VITS?
The VITS project contains posterior encoder which converts audio to latent space variables. But HuBERT does the same.
Does RVC work by generating latent space variables with HuBert and than use it for voice conversion in VITS?
Thank you for answer.