NewEricWang

Results 11 comments of NewEricWang

I also find the reverberation of the generated speech become more obvious along with the train iteration increasing.

@switchzts , Its average length is about 4s~5s. It is a single speaker database.

@switchzts , num_layers=10 in the model. The total time is about 12h. The train loss is about 11.0.

What is your input? Do you give model conditional input?

You can try to verify it by using some shorter sentences.

@begeekmyfriend , thanks for your help! I will try your proposed method.

> I had the same issue with 2xA6000 setup. Forcing training on a single GPU fixed it. How do you force training on a single GPU when running "python finetune.py...

> os.environ["CUDA_VISIBLE_DEVICES"] = "0" In my case, it don't work. I add the above code into "finetune.py". When running "python finetune.py", all GPU still are used.