tacotron icon indicating copy to clipboard operation
tacotron copied to clipboard

using GAN to enhance the spectrograms

Open pineking opened this issue 7 years ago • 2 comments

A post on googleblog says:

Most neural text-to-speech (TTS) systems produce over-smoothed spectrograms. When applied to the Tacotron TTS system, a GAN can recreate some of the realistic-texture, which reduces artifacts in the resulting audio.

https://research.googleblog.com/2017/12/tfgan-lightweight-library-for.html

pineking avatar Dec 13 '17 13:12 pineking

Did you explore this any further? I am inclined towards going this way as well..

Shikherneo2 avatar Jun 08 '18 22:06 Shikherneo2

Any update on this?

nukes avatar Feb 10 '21 13:02 nukes