tacotron using GAN to enhance the spectrograms

using GAN to enhance the spectrograms

Open pineking opened this issue 7 years ago • 2 comments

A post on googleblog says:

Most neural text-to-speech (TTS) systems produce over-smoothed spectrograms. When applied to the Tacotron TTS system, a GAN can recreate some of the realistic-texture, which reduces artifacts in the resulting audio.

https://research.googleblog.com/2017/12/tfgan-lightweight-library-for.html

Dec 13 '17 13:12 pineking

Did you explore this any further? I am inclined towards going this way as well..

Jun 08 '18 22:06 Shikherneo2

Any update on this?

Feb 10 '21 13:02 nukes

tacotron tacotron copied to clipboard

using GAN to enhance the spectrograms

tacotron
tacotron copied to clipboard