autovc icon indicating copy to clipboard operation
autovc copied to clipboard

Follow-up work available for viewing

Open auspicious3000 opened this issue 5 years ago • 7 comments

We have further improved AutoVC in 2 subsequent works.

The 1st work improves the audio quality by removing any pitch artifacts.

F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder https://arxiv.org/abs/2004.07370

The 2nd work can convert rhythm, pitch, and/or timbre at the same time.

Unsupervised Speech Decomposition via Triple Information Bottleneck https://arxiv.org/abs/2004.11284

auspicious3000 avatar Apr 24 '20 05:04 auspicious3000

Thats very impressive!

Are their plans to release the models for these new works?

ibraheem-nt avatar Apr 26 '20 09:04 ibraheem-nt

Most likely, but it will probably take a long time. All these works are the intellectual properties of companies. As we know, most companies are typically very strict about releasing code.

auspicious3000 avatar Apr 26 '20 09:04 auspicious3000

Understandable. Again great work and a real milestone in the field!

ibraheem-nt avatar Apr 26 '20 09:04 ibraheem-nt

Thanks! Nevertheless, these works can all be reproduced based on this repository.

auspicious3000 avatar Apr 26 '20 09:04 auspicious3000

The second work is of particular interest, adding emotions to synthesized speech is still rather hit-and-miss.

Immortalin avatar May 05 '20 20:05 Immortalin

Your work has been amazing!

qq547276542 avatar May 26 '20 06:05 qq547276542

@qq547276542 Thanks! More follow-up works will be released. Stay tuned.

auspicious3000 avatar May 26 '20 06:05 auspicious3000