AutoPST icon indicating copy to clipboard operation
AutoPST copied to clipboard

Missing basic execution with different set of speakers.

Open sainishalini opened this issue 2 years ago • 4 comments

Hi there, I am trying to follow the code with my own dataset and could run Main_1.py and main_2.py to get xxx-A.ckpt and xxx-B.ckpt files. Now I am not able to understand to run the demo file to prepare specific speakers dictionary to create and convert. Any help is appreciated with a little more direction to follow the steps.

sainishalini avatar Jun 12 '22 18:06 sainishalini

What matters is the model's required inputs. The model requires source cepstrum, source cepstrum lengths, masks made from source ceptrum lengths, and target speaker embedding. That's what the dictionary provides. You don't need to use a dictionary as long as you have the required inputs.

auspicious3000 avatar Jun 13 '22 18:06 auspicious3000

Thanks! I think, I may be missing something basic. Not clear on what to do next after training the model? How to create or define the sources speakers and target speakers? how to get the converted samples? Thanks for your help!

sainishalini avatar Jun 13 '22 18:06 sainishalini

You are welcome. For the basics, please refer to the paper. https://arxiv.org/abs/2106.08519

auspicious3000 avatar Jun 13 '22 21:06 auspicious3000

thanks, let me check again. I was hoping to get more straight forward direction and commands to run the code for the conversion by providing any set of source and target speakers through command line or modifying the demo code. Got lost on it after training the model.

sainishalini avatar Jun 13 '22 21:06 sainishalini