Josh
Josh
I know this is about a month late, but it's failing because `-` is an invalid character for a variable in this Mustache library. I can't find any indication of...
A couple notes on this based on my own experiences/preferences that you can take or leave: - This is indeed an incredibly helpful addition to a TTS system, IMO necessary...
I noticed that `data_utils_old` includes a speaker ID along with the audio data. I don't see any usage of speaker ID in the version of `train` that got committed; was...
> As can be seen the model has nowhere to consume speaker id (it does not have an embedding table), it's pointless to pass a speaker id. Yeah, that makes...
The first thing I tried was finetuning the model for a couple thousand steps with some different data. I got reasonable results, but nothing groundbreaking. Challenging cases like the one...
> accent is part of content. I can tell that's the case here, but it strikes me as a strange definition of the word "content". Content should be _what_ is...
If your dataset is sufficiently diverse, I think that kind of pitch issue is inevitable with FreeVC—the pretrained version might seem better because VCTK is less diverse than your data....