Retrieval-based-Voice-Conversion-WebUI
Retrieval-based-Voice-Conversion-WebUI copied to clipboard
[Feature Request] Nvidia Pytorch implementation of BigVGAN for higher quality and speed
BigVGAN is a Universal Neural Vocoder with Large-Scale Training, it looks very robust even in extreme situations. Nvidia already made a Pytorch implementation of it here : https://github.com/NVIDIA/BigVGAN?tab=readme-ov-file
The audio demos seem to have quite a bump in quality over all other techniques : https://bigvgan-demo.github.io/
Would it be possible to implement it?