VoiceConversionGANs

GAN series for voice conversion on VCC2018 dataset

This is a voice conversion repository including cyclegan-vc, stargan-vc, stargan-vc2 and some other variants

This work is still in progress, more GAN models will be included

This work is based on repository stargan-vc, stargan-vc2 and cyclegan-vc

Requirements:

Python3
PyTorch 0.4.1
Pyworld

Models:

stgan: stargan-vc1 from https://github.com/liusongxiang/StarGAN-Voice-Conversion
stgan2: stargan-vc2 from https://github.com/SamuelBroughton/StarGAN-Voice-Conversion-2
stgan1_cin: stargan-vc1 + generator with conditional instance normalization + speaker classifier
stgan2_new: stargan-vc2 + patchgan discriminator + only target condition in generator and discriminator + no speaker classifier + gradient penalty
stgan2_ls: stargan-vc2 + projection discriminator (as in the paper) + source and target conditions in generator and discriminator + LSGAN adversarial loss
cycgan: cyclegan-vc1

Preprocess

./run_pre.sh

Modify according to your own conda env and hyper-params

Train:

./run_train.sh

Modify according to your own conda env and hyper-params

Convert

./run_convert.sh

Objective Evaluation

./run_eval.sh

This evaluation calculate Mel Cepstral Distortion (MCD) and Modulation Spectral Distance (MSD) as in stargan-vc2 paper.

However, this script can not get the same score as the paper.

VoiceConversionGANs
VoiceConversionGANs copied to clipboard

Metadata

VoiceConversionGANs

This is a voice conversion repository including cyclegan-vc, stargan-vc, stargan-vc2 and some other variants

This work is based on repository stargan-vc, stargan-vc2 and cyclegan-vc

Requirements:

Models:

Preprocess

Train:

Convert

Objective Evaluation

← Metadata

Owner

Metadata

VoiceConversionGANs VoiceConversionGANs copied to clipboard

Metadata

VoiceConversionGANs

This is a voice conversion repository including cyclegan-vc, stargan-vc, stargan-vc2 and some other variants

This work is based on repository stargan-vc, stargan-vc2 and cyclegan-vc

Requirements:

Models:

Preprocess

Train:

Convert

Objective Evaluation

← Metadata

Owner

Metadata

VoiceConversionGANs
VoiceConversionGANs copied to clipboard