become-yukarin
become-yukarin copied to clipboard
Convert your voice to favorite voice
Become Yukarin: Convert your voice to favorite voice
Become Yukarin is a repository for voice conversion with a Deep Learning model. By traingin with a large amount of the original and favorite voice, The Deep Learning model can convert the original voice to the favorite voice.
Japanese README
Supported environment
- Linux OS
- Python 3.6
Preparation
# install required libraries
pip install -r requirements.txt
Training
To run a Python script for training,
you should set the environment variable PYTHONPATH
to find the become_yukarin
library.
For example, you can execute scripts/extract_acoustic_feature.py
with the following command:
PYTHONPATH=`pwd` python scripts/extract_acoustic_feature.py ---
First Stage Model
- Prepare voice data
- Put input/target voice data in two directories (with same file names)
- Create acoustic feature
-
scripts/extract_acoustic_feature.py
-
- Train
-
train.py
-
- Test
-
scripts/voice_conversion_test.py
-
Second Stage Model
- Prepare voice data
- Put input/target voice data in two directories
- Create acoustic feature
-
scripts/extract_spectrogram_pair.py
-
- Train
-
train_sr.py
-
- Test
-
scripts/super_resolution_test.py
-
- Convert other voice data
- Use SuperResolution class and AcousticConverter class
- sample code
Reference
- ipynb branch: Other sample code
- Commentary Blog (Japanese)
- Realtime Yukarin: Real-time voice conversion system
License
MIT License