ZeroSpeech icon indicating copy to clipboard operation
ZeroSpeech copied to clipboard

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Results 11 ZeroSpeech issues
Sort by recently updated
recently updated
newest added

Bumps [numpy](https://github.com/numpy/numpy) from 1.18.2 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...

dependencies

hi,i want to know the speaker id of the raw audio because i want to get the output without the speaker conversion

I set the directory for data as datasets/2019/english, when I run the script preprocess.py, it raises keyerror: 'accessing unknown key in a struct: dataset.in_dir' but I can't find how to...

In config/training/default.yaml file the batch_size is defined as 52. In general, the batch_size value will be power of 2. For example 8, 16, 32, 64, 128 so on. training: batch_size:...

Hi, very appreciate for your great jobs!! However, I'd like to train your VQVAE on my own dataset, whose sampling rate is 8 kHz. So.... would you please tell me...

Hi! I tried to train your model on VCTK dataset. But i think, that I have slow train speed: 15 minutes for 1 epoch: 28k samples with 25 batchsize train...

I want to use it on cpu do real time voice conversion, I don't know whether the model fits my task.

Thank you for sharing your git. My question is same above. Does this work with the unseen speech?

I am interested in vocal cloning from van der Oord et al and found this repo. I try to install on a Google Colab and get: ``` !pip install -r...