TFGAN icon indicating copy to clipboard operation
TFGAN copied to clipboard

Why does this network ask to enter any spectrogram at the last stage?

Open etimsijs opened this issue 4 years ago • 2 comments

Hello, could you please tell me why does the network ask to enter any spectrogram at the time of the outputting the result? I mean this command python inference.py -p [checkpoint path] -i [input mel path] Usually , GAN networks generate random noise by themselves, so why does network need mel to output the result ?

etimsijs avatar Feb 13 '21 20:02 etimsijs

@etimsijs Not in the case of Vocoder GAN like melgan, vocgan or TFGAN. In these GAN we take melspec as input and directly upsample to mels to hop size times to get audio wav, we don't condition noise we directly use mel spectrogram.

rishikksh20 avatar Feb 16 '21 16:02 rishikksh20

@etimsijs Not in the case of Vocoder GAN like melgan, vocgan or TFGAN. In these GAN we take melspec as input and directly upsample to mels to hop size times to get audio wav, we don't condition noise we directly use mel spectrogram.

So, is this network for audiophiles ? The network can increase the sampling rate from 22 kHz to 41 kHz, let's say? Is it not for generating audio?

etimsijs avatar Feb 17 '21 15:02 etimsijs