knn-vc
knn-vc copied to clipboard
Using a Generator Network Before the Vocoder
Hi,
First of all, great work on this project!
I have a question regarding the architecture. What would happen if you introduced a generator network before the vocoder to generate mel spectrograms, and then trained the generator while using a pre-trained vocoder? I'm curious about how this approach might affect the performance and quality of the generated audio.
Looking forward to your thoughts on this.