George Grigorev comments

Results 19 comments of


                                            George Grigorev

Training custom_vqgan: ConfigAttributeError: Missing key logger

thank you, `pip install pytorch-lightning==1.0.8 omegaconf==2.0.0` helped for some reason `conda env create -f environment.yaml` didn't install some required libs

comparison with univnet

have you tried to train on LJSpeech or your dataset? How much iterations needed comparing with HiFiGAN? Do you have checkpoints somewhere?

comparison with univnet

got it, thanks

comparison with univnet

tried it out. i compare publicly available universal v1 hifigan (trained on 2.5M iterations on vctk) with this one trained on 150k at new HIFI-TTS dataset (5 times more data)....

comparison with univnet

3x3090 with batch 16 but I can confirm that fre-gan is training much faster than hifi-gan

Fine-tuning from pretrained models?

@EmreOzkose i guess that's because author shared discriminator only for universal model

Minimum hours of data required for fine-tuning for a single unseen speaker

@balag59 same here. But I generated mels from tts model. After 50k iterations overall quality of audio increased, but identity of speaker lost. I guess I should train a lot...

Minimum hours of data required for fine-tuning for a single unseen speaker

![image](https://user-images.githubusercontent.com/11796343/112413556-5efb4d00-8d5b-11eb-904c-4a37c51f28b7.png) @jik876 I guess it's not normal and I should grab more data?

Dreambooth example on SD2-768 model is producing weird results

I had the same issue when SD2 only came out. I believe it has something with v_prediction of 768 model. I tried to modify sampler in dreambooth training script but...

distorted spectrograms after model

alright. before: https://voca.ro/1meOfwM2dIEw after:https://voca.ro/11L19CihKeI6