Auffusion icon indicating copy to clipboard operation
Auffusion copied to clipboard

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Results 2 Auffusion issues
Sort by recently updated
recently updated
newest added

I tested and found that the duration of the output audio is always 10 seconds. How to modify the code to make the output audio duration consistent with the input...

Hi, do you directly use the pre-trained VAE in LDM? Or the VAE is first pre-trained on audio spec? Thank you very much.