FastDiff icon indicating copy to clipboard operation
FastDiff copied to clipboard

PyTorch Implementation of FastDiff (IJCAI'22)

Results 17 FastDiff issues
Sort by recently updated
recently updated
newest added

Hey @Rongjiehuang, Thanks a lot for open-sourcing the checkpoint for the FastDiff vocoder for LJSpeech! I played around with the code a bit and I'm only getting quite noisy generations...

I noticed that the supported datasets of this repository have different audio sample rate. Is the output sample rate of FastDiff vocoder bound to its training data, or does it...

Hello,than you for for awesome work. When I run your script of > Inference for text-to-speech synthesis in ReadMe, but got an error: > Traceback (most recent call last): File...

Are there any experiments done to evaluate the performance of FastDiff on singing voice datasets?

I was trying to run the repo on colab: - Inference for text-to-speech synthesis - Inference from wav file using the commands given in the ReadMe file but I am...

![image](https://user-images.githubusercontent.com/81682187/174861312-db407221-afd8-4b8b-9ce9-70806f9790c4.png) Hello, I want to finetune the model with my own dataset I want to understand more about how data structure should be in the following folder : raw_data_dir, processed_data_dir,...

Hello I was following the demo, and I ran into some trouble with the yaml files: so I tried grabbing many parameters from `PortaSpeech/diffspeech/config.yaml` and add them to `modules/FastDiff/config/FastDiff.yaml`, but...

Hello I'm trying to implement noise scheduling process refer to BDDM's implementation [BDDM/sampler.py](https://github.com/tencent-ailab/bddm/blob/main/bddm/sampler/sampler.py) And I have some question for noise scheduling process for FastDiff-TTS. 1. In the Fastdiff paper, the...

How about fastdiff's support for Chinese voice? Did you do experiments?

From the paper, it seems like a FastSpeech2 model was trained end-to-end in combination with the diffusion vocoder. Are you planning on releasing its weights as well? This would be...