Auffusion
Auffusion copied to clipboard

→

Metadata

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Reame
Issues

Results 2 Auffusion issues

Sort by recently updated

Can I control the duration of theText-guided style transfer's output audio?

I tested and found that the duration of the output audio is always 10 seconds. How to modify the code to make the output audio duration consistent with the input...

hello-xiaow

About pre-trained VAE

Hi, do you directly use the pre-trained VAE in LDM? Or the VAE is first pre-trained on audio spec? Thank you very much.

kaiw7

← Metadata

119

Stars

Forks

Watchers

Owner

happylittlecat2333

Metadata

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Back

Auffusion Auffusion copied to clipboard

Metadata

Can I control the duration of theText-guided style transfer's output audio?

About pre-trained VAE

← Metadata

Owner

Metadata

Auffusion
Auffusion copied to clipboard