autovc
autovc copied to clipboard
Follow-up work available for viewing
We have further improved AutoVC in 2 subsequent works.
The 1st work improves the audio quality by removing any pitch artifacts.
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder https://arxiv.org/abs/2004.07370
The 2nd work can convert rhythm, pitch, and/or timbre at the same time.
Unsupervised Speech Decomposition via Triple Information Bottleneck https://arxiv.org/abs/2004.11284
Thats very impressive!
Are their plans to release the models for these new works?
Most likely, but it will probably take a long time. All these works are the intellectual properties of companies. As we know, most companies are typically very strict about releasing code.
Understandable. Again great work and a real milestone in the field!
Thanks! Nevertheless, these works can all be reproduced based on this repository.
The second work is of particular interest, adding emotions to synthesized speech is still rather hit-and-miss.
Your work has been amazing!
@qq547276542 Thanks! More follow-up works will be released. Stay tuned.